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MORPHOGEN-ENRICHED DIETARY COMPOSITION 



Field of the Invention 

This invention relates generally to the field of 
5 dietary compositions and supplements. 

Background of the Invention 

The present invention relates to compositions 

10 useful as mammalian dietary compositions and 

supplements. In particular, the invention relates to 
food additives and dietary supplements capable of 
enhancing tissue morphogenesis and development, 
particularly in individuals at risk for normal tissue 

15 development and viability. Examples of such 

individuals include infants, particularly prematurely- 
born ( "preterm" ) and low birth weight infants , and 
juveniles; aged individuals; and individuals 
experiencing altered metabolic function and/or 

20 suffering from metabolic dysfunctions and other 

disorders that threaten organ or tissue function or 
viability, such as can result from malnutrition or 
starvation, autoimmune diseases, organ cirrhosis and 
other tissue necrotizing dysfunctions, or disorders 

25 associated with aging cells (cell senescence.) 

Mammalian infants are nourished by mother's milk 
until such time as they can digest food solids. Infant 
formulas now exist for humans and other mammals which 
30 can supplant or supplement mother's milk. The formulas 
may be milk based (e.g., cow milk) or non-milk-based 
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(e.g., soy). Particularly at risk are prematurely 
born infants whose tissues and organs are at an earlier 
stage of development, and whose nutritional 
requirements may differ from those of full term 
5 infants. Formula development is an ongoing endeavor to 
more accurately mimic the beneficial aspects of 
mother's milk. Nevertheless, despite the efforts of 
many researchers, infant formulas still differ in a 
number of significant ways from human milk. In part 

10 this is due because human milk has many substances, such 
as immunoglobulins, free amino acids, polyamines, 
nucleotides and polyunsaturated fatty acids not 
present, for example, in cow's milk. In addition, 
while infant formulas try to mimic the protein quantity 

15 found in human milk, the foreign proteins typically are 
present in the formula as hydrolysates to avoid 
rejection or reaction by the infant's digestive system. 
The proteins are present primarily as amino acid 
sources rather than as functional proteins as might 

20 normally be transmitted by the nursing mother to the 
infant. In addition, human milk may contain 
unidentified growth and differentiation factors that 
are important for overall tissue and skeletal 
development. 

25 

Another group of individuals with potentially 
unique nutritional requirements are individuals 
undergoing metabolic changes which may result from 
periods of intense growth or stress, including, for 
30 example, pregnant women and drowning victims. Other 
sources of stress to the body may result from 
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malnutrition or starvation, or from metabolic disorders 
that affect organ viability , such as autoimmune disease 
and organ cirrhosis. Aged individuals, and 
postmenopausal women also have altered or slower 
5 metabolic function. All of these individuals are at 
risk for tissue damage or loss of tissue function due 
to altered metabolic function. 

Reduced or lost tissue function due to malnutrition 
also is found in many patients admitted to hospitals 
(protein energy malnutrition, "PEM" ). Proper 
nutritional support for such patients, while not a 
primary mode of treatment is, nevertheless, an 
important factor for therapy and recovery. It is, 
therefore important to administer a nutritionally 
balanced diet given orally, enterally or parenterally, 
adequate to the needs of the patient. This is 
especially true for those patients where conventional 
feeding is contraindicated (e.g., in dehydrated or 
gastroenterological patients) or is insufficient (e.g., 
in hypercatabolic patients ) . The enteral or oral mode 
of administration of foods typically is preferable to 
parenteral modes because of the lower morbidity, 
trophic effect upon the intestinal mucosa, reduced 
dependency on instrumentation and lower costs. 

It is an object of this invention to provide 
dietary compositions and supplements for enhancing 
tissue morphogenesis, including tissue growth, 
30 development, maintenance and viability in a mammal, 
particularly a human. Another object of the invention 
is to provide an infant formula capable of enhancing 
tissue development in an infant or juvenile. Still 
another object is to provide an an infant formula that 
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more closely mimics a nursing mother's milk. Another 
object of the invention is to provide dietary 
supplements for individuals at risk for normal tissue 
development, growth, maintenance and viability, 
5 including premature infants, aged individuals and 
individuals with altered metabolic function and/or 
suffering from disorders or metabolic dysfunctions 
which threaten organ viability and function. These and 
other objects and features of the invention will be 
10 apparent from the description, drawings, and claims 
which follow. 

Summary of the Invention 

15 The present invention provides compositions and 

methods useful in dietary applications and capable of 
enhancing tissue morphogenesis, including tissue 
growth, development, maintenance and viability in a 
mammal , particularly a human. The dietary compositions 

20 and supplements of this invention comprise a 

morphogenic protein ( "morphogen" ) , as described herein, 
which, when provided to an individual as a food 
formulation or supplement, is capable of enhancing 
tissue development, growth, maintenance and/or 

25 viability in the individual. The compositions and 

processes provided herein are suitable for both infants 
and adults, and as part of clinical nutrition. 

As used herein, "enhancing tissue viability" is 
30 understood to mean protecting tissue from lost or 
reduced tissue function due to cell damage or cell 
senescence, including inducing cells to maintain their 
differentiated phenotype, inducing regeneration of 
damaged tissue, and/or inhibiting additional damage 
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thereto. "Morphogenically effective concentration" is 
understood to mean a concentration sufficient to 
enhance tissue development and tissue viability in an 
individual at risk for tissue damage and/or reduced or 
5 lost tissue function due to insufficient nutritional 
considerations, tissue damage associated therewith, 
and/or incomplete tissue development, regardless of 
etiology. The ability of morphogens to repair, 
regenerate and protect various disparate tissues, 

10 including but not limited to, tissues of the 

gastrointestinal tract, including the oral mucosa, 
liver tissue, dentin tissue, periodontal tissue, nerve 
tissue, bone tissue, and any tissue at risk of damage 
due to immune response-mediated tissue destruction, 

15 including ischemia-reperfusion related tissue damage 
are disclosed in international applications US 92/01968 
(WO 92/15323), US 92/07358 (WO 93/04692) and US 
92/07232 (WO 93/05751) respectively, the disclosures of 
which are incorporated herein by reference. 

20 "Morphogen-solubilizing molecule" is understood to mean 
a molecule capable of maintaining a raorphogen in 
soluble form in physiologically buffered solutions. 
"Food formulation" is understood to mean a dietary 
composition normally ingested by an individual to 

25 satisfy the body's fundamental nutritional 

requirements; "dietary supplement" is understood to 
mean supplemental compositions ingested by an 
individual in addition to the food formulations 
ingested to satisfy the fundamental nutritional 

30 requirements. Multivitamin and iron tablets are 
examples of commmon dietary supplements. "Dietary 
composition" is understood to include both food 
formulations and dietary supplements. As used herein, 
the term "infant formula" is understood to refer to the 
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well established infant compositions as defined by the 
American Academy of Pediatrics (AAP) and the AAP 
Committee on Nutrition ((1985) Pediatrics 75; 976, the 
European Society of Pediatric Gastroenterology and 
5 Nutrition (ESPGAN) and the ESPGAN Committee on 

Nutrition ((1987) Acta Paed Scan Suppl ; 330 ), including 
recent updates published by these committees on infant 
formula nutritional guidelines. 

10 The dietary composition or supplement preferably is 

administered orally, and may be provided in liquid form 
or as a powder to be dissolved in a beverage. 
Alternatively, the dietary supplement may be provided 
as a solid, e.g., in a capsular, tablet, troche or 

15 lozenge form; or, the supplement may be provided as an 
aerosol, for oral or nasal administration. Where oral 
administration is not possible or desirable, other 
administration routes are envisioned. For example, for 
some premature infants, or for intibated patients, 

20 parenteral administration may be required, e.g., via an 
enteral feeding tube. 

The morphogen may be provided alone or in 
association with one or more suitable excipients or 

25 carriers, and/or in combination with other beneficial 
molecules such as vitamins, minerals, lipids, fiber 
sources and the like. The dietary supplements also may 
include pharmaceutical^ acceptable inert materials for 
use as binders or stabilizers, including magnesium 

30 stearate or calcium carbonate. The morphogen may be 
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formulated together with one or more normal food 
ingredients, e.g., as part of a food formulation. 
Alternatively or, in addition, the morphogen may be 
provided as a dietary supplement in, for example, 
5 tablet or syrup form. 

The mature form of the morphogen, or active 1 
truncated forms thereof which may be formulated in the 
composition, further may be provided in association 

10 with a morphogen precursor "pro" domain, which is known 
to enhance the solubility of the protein in 
physiologically buffered solutions. Other useful 
molecules known to enhance protein solubility include 
casein, including derivatives, salts and analogs 

15 thereof, as well as other milk components, and various 
serum and milk serum proteins. Additional useful 
molecules which may be associated with the morphogen 
include tissue targeting molecules capable of directing 
the morphogen to a desired target tissue. Tissue 

20 targeting molecules envisioned to be useful in the 
treatment protocols of this invention include 
antibodies, antibody fragments or other binding 
proteins which interact specifically with surface 
molecules on the target tissue cells* 

25 

Still another useful tissue targeting molecule may 
be part or all of a morphogen precursor "pro" domain. 
Morphogens may be synthesized in one tissue and 
secreted and transported to another tissue. For 
30 example, while the protein has been shown to be active 
in bone tissue, the primary source of OP-1 synthesis 
appears to be the tissue of the urogenic system (e.g., 
renal and bladder tissue), with secondary expression 
levels occurring in the brain, heart, lungs and 
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gastrointestinal tract (GI tract, see below.) 
Moreover, the protein has been identified in serum, 
saliva and various milk forms. In addition, the 
secreted form of the protein comprises the mature dimer 
5 in association with the pro domain of the intact 
morphogen sequence. Accordingly, the associated 
morphogen pro domains may act to target specific 
morphogens to different tissues in vivo . As described 
below, morphogen species comprising the pro domain may 
10 be obtained from the culture medium of raorphogen- 
secreting mammalian cells. Alternatively, a tissue- 
targeting species may be formulated by complexing the 
mature dimer (or an active fragment thereof) with part 
or all of a pro domain. 

15 

Associated tissue targeting or solubility-enhancing 
molecules also may be covalently linked to the 
morphogen using standard chemical means. 

20 In one preferred embodiment, the morphogen 

comprises part of an infant formula. The infant formula 
may be milk-based or nonrailk-based, e.g., soy-based. A 
typical ready-to-feed morphogen-enriched formulation 
for infants, when diluted to feeding concentrations, 

25 comprises, in addition to the morphogen added to the 

formula, from about 1-5% by weight fat, from about 0.01 
to about 0.5% by weight immunoglobulins as appropriate, 
from about 4-10% by weight carbohydrate in a quantity 
substantially to mimic the carbohydrate content of 

30 human mother's milk, from about 0.5 to 4% by weight 
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protein in a quantity substantially to mimic the 
protein content of human mother's milk, optional 
vitamins and minerals as required, a total solids 
content of from about 8 to 17% by weight, and the 
5 remainder water . 

In another preferred embodiment, the dietary 
composition is formulated for individuals at risk for 
reduced or lost tissue function, such as postmenopausal 

10 women, elderly individuals, undernourished or 

malnourished individuals, dehydrated individuals, 
drowning victims, individuals suffering from metabolic 
disorders including an endocrine imbalance, 
gastrointestinal disorders, or immune-compromised 

15 individuals. Undernourished or malnourished 

individuals include those suffering from a lack of food 
(starvation) and/or eating disorders (e.g., anorexia 
nervosa), and/or suffering from a maladsorption 
syndrome (e.g., individuals afflicted with digestive or 

20 intestinal fistulas, shortened bowel, or 

hypercatabolism. ) Individuals receiving a medical 
therapy, including radiotherapy, chemotherapy or a 
surgical procedure also are at risk for reduced or lost 
tissue function as a result of a therapy-related 

25 malabsorption-malnutrition dysfunction. In another 
embodiment, the dietary supplement is formulated for 
individuals undergoing periods of increased growth or 
stress, such as infants and juveniles, or pregnant or 
lactating women. In another embodiment, the dietary 

30 supplement is formulated for individuals at risk for 
reduced or lost organ function as results from tissue 
cirrhosis or an autoimmune disease. 



WO 94/03075 



PCT/US93/07190 



- 10 - 

Morphogen-enriched nutritional products, 
particularly clinical nutrition products for use in 
hospital or other clinical settings , in addition to 
comprising a morphogen preferably are based on the 
5 utilization of diverse other protein sources ( casein , 
sodium and calcium caseinate, isolated soy protein, 
protein hydrolyzates and/or crystalline amino acids) 
mixtures of vegetable and animal fats, carbohydrates 
(basically glucose polymers), vitamins and minerals to 
10 meet, at least, the dietary intakes recommended for 
healthy individuals (see, for example, Committee on 
Dietary Allowances, Food and Nutrition Board, Nat Acad 
Sci, 9th Ed, 1980). 

15 Among the morphogens useful in this invention are 

proteins originally identified as osteogenic proteins, 
such as the OP-1, OP-2 and CBMP2 proteins, as well as 
amino acid sequence-related proteins such as DPP (from 
Drosophila), Vgl (from Xenopus), Vgr-1 (from mouse, see 

20 U.S. 5,011,691 to Oppermann et al.), GDF-1 (from mouse, 
see Lee (1991) PNAS 88:4250-4254), all of which are 
presented in Table II and Seq. ID Nos.5-14), and the 
recently identified 60A protein (from Drosophila, Seq. 
ID No. 24, see Wharton et al. (1991) PNAS 

25 88:9214-9218.) The members of this family, which 

include members of the TGF-p super-family of proteins, 
share substantial amino acid sequence homology in their 
C-terminal regions. The proteins are translated as a 
precursor, having an N- terminal signal peptide 

30 sequence, typically less than about 30 residues, 

followed by a "pro" domain that is cleaved to yield the 
mature sequence. The "pro" form of the protein 
includes the pro domain and the mature domain, and 
forms a soluble species that "appears to be the primary 
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form secreted from cultured mammalian cells. The 
signal peptide is cleaved rapidly upon translation, at 
a cleavage site that can be predicted in a given 
sequence using the method of Von Heijne ((1986) Nucleic 
5 Acids Research 14x4683-4691.) Table I, below, 

describes the various morphogens identified to date, 
including their nomenclature as used herein, their Seq. 
ID references, and publication sources for the amino 
acid sequences for the full length proteins not 
10 included in the Seq. Listing. The disclosure of these 
publications is incorporated herein by reference. 

TABLE I 

15 "OP-l" Refers generically to the group of 

morphogenically active proteins expressed 
from part or all of a DNA sequence 
encoding OP-1 protein, including allelic 
and species variants thereof, e.g., human 

20 OP-1 ("hOP-l", Seq. ID No. 5, mature 

protein amino acid sequence), or mouse 
OP-1 ("raOP-l n , Seq. ID No. 6, mature 
protein amino acid sequence.) The 
conserved seven cysteine skeleton is 

25 defined by residues 38 to 139 of Seq. ID 

Nos. 5 and 6. The cDNA sequences and the 
amino acids encoding the full length 
proteins are provided in Seq. Id Nos. 16 
and 17 (hOPl) and Seq. ID Nos. 18 and 19 

30 (mOPl.) The mature proteins are defined 
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by residues 293-431 (hOPl) and 292-430 
(mOPl). The "pro" regions of the 
proteins, cleaved to yield the mature, 
morphogenically active proteins are 
defined essentially by residues 30-292 
(hOPl) and residues 30-291 (mOPl). 

refers generically to the group of active 
proteins expressed from part or all of a 
DNA sequence encoding OP-2 protein, 
including allelic and species variants 
thereof, e.g., human OP-2 ("hOP-2", Seq. 
ID No. 7, mature protein amino acid 
sequence) or mouse OP-2 ("mOP-2 n , Seq. ID 
No. 8, mature protein amino acid 
sequence). The conserved seven cysteine 
skeleton is defined by residues 38 to 139 
of Seq. ID Nos. 7 and 8. The cDNA 
sequences and the amino acids encoding the 
full length proteins are provided in Seq. 
ID Nos. 20 and 21 (hOP2) and Seq. ID Nos. 
22 and 23 (mOP2.) The mature proteins are 
defined essentially by residues 264-402 
(hOP2) and 261-399 (mOP2). The "pro" 
regions of the proteins, cleaved to yield 
the mature, morphogenically active 
proteins likely are defined essentially by 
residues 18-263 (hOP2) and residues 18-260 
(mOP2). (Another cleavage site also 
occurs 21 residues upstream for both OP-2 
proteins • ) 
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"CBMP2 n refers generically to the morphogenically 

active proteins expressed from a DNA 
sequence encoding the CBMP2 proteins, 
including allelic and species variants 
5 thereof, e.g., human CBMP2A ( "CBMP2A(fx) 

Seg ID No. 9) or human CBMP2B DNA 
("CBMP2B(fx) M , Seq. ID No. 10). The amino 
acid sequence for the full length 
proteins, referred to in the literature as 

10 BMP2A and BMP2B, or BMP 2 and BMP4, appear 

in Wozney, et al. (1988) Science 242:1528- 
1534. The pro domain for BMP 2 (BMP2A) 
likely includes residues 25-248 or 25-282; 
the mature protein, residues 249-396 or 

15 283-396. The pro domain for BMP 4 (BMP2B) 

likely includes residues 25-256 or 25-292; 
the mature protein, residues 257-408 or 
293-408. 

20 "DPP(fx)" refers to protein sequences encoded by the 

Drosophila DPP gene and defining the 
conserved seven cysteine skeleton (Seq. ID 
No. 11). The amino acid sequence for the 
full length protein appears in Padgett, et 

25 al (1987) Nature 325: 81-84. The pro 

domain likely extends from the signal 
peptide cleavage site to residue 456; the 
mature protein likely is defined by 
residues 457-588. 

30 

"Vgl(fx) ,f refers to protein sequences encoded by the 
Xenopus Vgl gene and defining the 
conserved seven cysteine skeleton (Seq. ID 
No. 12). The amino acid sequence for the 
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full length protein appears in 
Weeks (1987) Cell 51; 861-867. The 
prodomain likely extends from the signal 
peptide cleavage site to residue 246; the 
5 mature protein likely is defined by 

residues 247-360. 

"Vgr-l(fx) " refers to protein sequences encoded by the 
murine Vgr-1 gene and defining the 

10 conserved seven cysteine skeleton (Seq. ID 

No. 13). The amino acid sequence for the 
full length protein appears in Lyons , et 
al, (1989) PNAS 86; 4554-4558. The 
prodomain likely extends from the signal 

15 peptide cleavage site to residue 299; the 

mature protein likely is defined by 
residues 300-438. 

"GDF-l(fx)" refers to protein sequences encoded by the 
20 human GDF-1 gene and defining the 

conserved seven cysteine skeleton (Seq. ID 
No. 14). The cDNA and encoded amino 
sequence for the full length protein is 
provided in Seq. ID. No. 32. The 
25 prodomain likely extends from the signal 

peptide clavage site to residue 214; the 
mature protein likely is defined by 
residues 215-372. 

30 "60A" refers generically to the morphogenically 

active proteins expressed from part or all 
of a DNA sequence (from the Drosophila 6 OA 
gene) encoding the 60A proteins (see Seq. 
ID No. 24 wherein the cDNA and encoded 
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amino acid sequence for the full length 
protein is provided). "60A(fx)" refers to 
the protein sequences defining the 
conserved seven cysteine skeleton 
5 (residues 354 to 455 of Seq. ID No. 24.) 

The prodomain likely extends from the 
signal peptide cleavage site to residue 
324; the mature protein likely is defined 
by residues 325-455. 

10 

"BMP3(fx) n refers to protein sequences encoded by the 
human BMP 3 gene and defining the conserved 
seven cysteine skeleton (Seq. ID No. 26). 
The amino acid sequence for the full 

15 length protein appears in Wozney et al. 

(1988) Science 242 : 1528-1534. The pro 
domain likely extends from the signal 
peptide cleavage site to residue 290; the 
mature protein likely is defined by 

20 residues 291-472. 

M BMP5(fx)" refers to protein sequences encoded by the 
human BMP 5 gene and defining the conserved 
seven cysteine skeleton (Seq. ID No. 27). 

25 The amino acid sequence for the full 

length protein appears in Celeste, et al. 
(1991) PNAS 87 ; 9843-9847. The pro domain 
likely extends from the signal peptide 
cleavage site to residue 316; the mature 

30 protein likely is defined by residues 

317-454. 
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"BMP6(fx) n refers to protein sequences encoded by the 
human BMP6 gene and defining the conserved 
seven cysteine skeleton (Seq. ID No. 28). 
The amino acid sequence for the full 
5 length protein appears in Celeste , et al. 

(1990) PNAS 87: 9843-5847. The pro domain 
likely includes extends from the signal 
peptide cleavage site to residue 374; the 
mature sequence likely includes 
10 residues 375-513. 



The OP-2 proteins have an additional cysteine 
residue in this region (e.g., see residue 41 of Seq. ID 

15 Nos. 7 and 8), in addition to the conserved cysteine 
skeleton in common with the other proteins in this 
family. The GDF-1 protein has a four amino acid insert 
within the conserved skeleton (residues 44-47 of Seq. 
ID No. 14) but this insert likely does not interfere 

20 with the relationship of the cysteines in the folded 

structure. In addition, the CBMP2 proteins are missing 
one amino acid residue within the cysteine skeleton. 

The morphogens are inactive when reduced, but are 
25 active as oxidized homodimers and when oxidized in 
combination with other morphogens of this invention. 
Thus, as defined herein, a morphogen is a dimeric 
protein comprising a pair of polypeptide chains, 
wherein each polypeptide chain comprises at least the 
30 C-terminal six cysteine skeleton defined by residues 
43-139 of Seq. ID No. 5, including functionally 
equivalent arrangements of these cysteines (e.g., amino 
acid insertions or deletions which alter the linear 
arrangement of the cysteines in the sequence but not 
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their relationship in the folded structure), such that, 
when the polypeptide chains are folded, the dimeric 
protein species comprising the pair of polypeptide 
chains has the appropriate three-dimensional structure, 
5 including the appropriate intra- or inter-chain 
disulfide bonds such that the protein is capable of 
acting as a morphogen as defined herein. Specifically, 
the morphogens generally are capable of all of the 
following biological functions in a morphogenically 

10 permissive environment: stimulating proliferation of 
progenitor cells; stimulating the differentiation of 
progenitor cells; stimulating the proliferation of 
differentiated cells, and supporting the growth and 
maintenance of differentiated cells. In addition, it 

15 is also anticipated that these morphogens are capable 
of inducing redif ferentiation of committed cells under 
appropriate environmental conditions. 

In one preferred aspect, the morphogens of 
20 this invention comprise one of two species of generic 
amino acid sequences: Generic Sequence 1 (Seq. ID 
No. 1) or Generic Sequence 2 (Seq. ID No. 2); where 
each Xaa indicates one of the 20 naturally-occurring 
L-isomer, a-amino acids or a derivative thereof. 
25 Generic Sequence 1 comprises the conserved six cysteine 
skeleton and Generic Sequence 2 comprises the conserved 
six cysteine skeleton plus the additional cysteine 
identified in OP-2 (see residue 36, Seq. ID No. 2). In 
another preferred aspect, these sequences further 
30 comprise the following additional sequence at their N- 
terminus : 

Cys Xaa Xaa Xaa Xaa (Seq. ID No. 15) 
1 5 



35 
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Preferred amino acid sequences within the foregoing 
generic sequences include: Generic Sequence 3 (Seq. ID 
No. 3) f Generic Sequence 4 (Seq, ID No. 4), Generic 
Sequence 5 (Seq. ID No. 30) and Generic Sequence 6 
5 (Seq. ID No. 31) , listed below. These Generic 

Sequences accommodate the homologies shared among the 
various preferred members of this morphogen family 
identified in Table II , as well as the amino acid 
sequence variation among them. Specifically/ Generic 

10 Sequences 3 and 4 are composite amino acid sequences of 
the following proteins presented in Table II and 
identified in Seq. ID Nos. 5-14: human OP-1 (hOP-1, 
Seq. ID Nos. 5 and 16-17 ), mouse OP-1 (mOP-1, Seq. ID 
Nos. 6 and 18-19), human and mouse OP-2 (Seq. ID 

15 Nos. 7, 8, and 20-22), CBMP2A (Seq. ID No. 9), CBMP2B 
(Seq. ID No. 10), DPP (from Drosophila, Seq. ID 
No. 11), Vgl, (from Xenopus, Seq. ID No. 12), Vgr-1 
(from mouse, Seq. ID No. 13), and GDP-1 (from mouse, 
Seq. ID No. 14.) The generic sequences include both 

20 the amino acid identity shared by the sequences in 
Table II, as well as alternative residues for the 
variable positions within the sequence. Note that 
these generic sequences allow for an additional 
cysteine at position 41 or 46 in Generic Sequences 3 or 

25 4, respectively, providing an appropriate cysteine 

skeleton where inter- or intramolecular disulfide bonds 
can form, and contain certain critical amino acids 
which influence the tertiary structure of the proteins. 
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Generic Sequence 3 
Leu Tyr Val Xaa Phe 

1 5 

Xaa Xaa Xaa Gly Trp Xaa Xaa Trp Xaa 

5 10 

Xaa Ala Pro Xaa Gly Xaa Xaa Ala 

15 20 

Xaa Tyr Cys Xaa Gly Xaa Cys Xaa 

25 30 

10 Xaa Pro Xaa Xaa Xaa Xaa Xaa 

35 

Xaa Xaa Xaa Asn His Ala Xaa Xaa 

40 45 
Xaa Xaa Leu Xaa Xaa Xaa Xaa Xaa 
15 50 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys 

55 60 
Cys Xaa Pro Xaa Xaa Xaa Xaa Xaa 
65 

20 Xaa Xaa Xaa Leu Xaa Xaa Xaa 

70 75 
Xaa Xaa Xaa Xaa Val Xaa Leu Xaa 
80 

Xaa Xaa Xaa Xaa Met xaa Val Xaa 
25 85 90 
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Xaa Cys Gly Cys Xaa 
95 

wherein each Xaa is independently selected from a group 
of one or more specified amino acids defined as 
5 follows: "Res." means "residue" and Xaa at res. 4 = 
(Ser, Asp or Glu); Xaa at res. 6 ■ (Arg, Gin, Ser or 
Lys); Xaa at res. 7 = (Asp or Glu); Xaa at res. 8 = (Leu 
or Val); Xaa at res. 11 - (Gin, Leu, Asp, His or Asn); 
Xaa at res. 12 = (Asp, Arg or Asn); Xaa at res. 14 = (He 

10 or Val); Xaa at res. 15 = (He or Val); Xaa at res. 18 = 
(Glu, Gin, Leu, Lys, Pro or Arg); Xaa at res. 20 « (Tyr 
or Phe); Xaa at res. 21 « (Ala, Ser, Asp, Met, His, Leu 
or Gin); Xaa at res. 23 - (Tyr, Asn or Phe); Xaa at 
res. 26 = (Glu, His, Tyr, Asp or Gin); Xaa at res. 28 « 

15 (Glu, Lys, Asp or Gin); Xaa at res. 30 = (Ala, Ser, Pro 
or Gin); Xaa at res. 31 « (Phe, Leu or Tyr); Xaa at 
res. 33 = (Leu or Val); Xaa at res. 34 - (Asn, Asp, Ala 
or Thr); Xaa at res. 35 - (Ser, Asp, Glu, Leu or Ala); 
Xaa at res. 36 ■ (Tyr, Cys, His, Ser or He); Xaa at 

20 res. 37 « (Met, Phe, Gly or Leu); Xaa at res. 38 ~ (Asn 
or Ser); Xaa at res. 39 = (Ala, Ser or Gly); Xaa at 
res. 40 = (Thr, Leu or Ser); Xaa at res. 44 = (He or 
Val); Xaa at res. 45 « (Val or Leu); Xaa at res. 46 = 
(Gin or Arg); Xaa at res. 47 = (Thr, Ala or Ser); Xaa at 

25 res. 49 = (Val or Met); Xaa at res. 50 = (His or Asn); 
Xaa at res. 51 « (Phe, Leu, Asn, Ser, Ala or Val); Xaa 
at res. 52 » (He, Met, Asn, Ala or Val); Xaa at res. 53 
= (Asn, Lys, Ala or Glu); Xaa at res. 54 = (Pro or Ser); 
Xaa at res. 55 = (Glu, Asp, Asn, or Gly); Xaa at res. 56 

30 * (Thr, Ala, Val, Lys, Asp, Tyr, Ser or Ala); Xaa at 
res. 57 = (Val, Ala or He); Xaa at res. 58 = (Pro or 
Asp); Xaa at res. 59 = (Lys or Leu); Xaa at res. 60 = 
(Pro or Ala); Xaa at res. 63 » (Ala or Val); Xaa at 
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res. 65 = (Thr or Ala); Xaa at res. 66 = (Gln f Lys, Arg 
or Glu); Xaa at res. 67 = (Leu, Met or Val); Xaa at 
res. 68 = (Asn, Ser or Asp); Xaa at res. 69 = (Ala, Pro 
or Ser); Xaa at res. 70 = (He, Thr or Val); Xaa at 
5 res. 71 « (Ser or Ala); Xaa at res. 72 ■ (Val or Met); 
Xaa at res. 74 = (Tyr or Phe); Xaa at res. 75 = (Phe, Tyr 
or Leu); Xaa at res. 76 = (Asp or Asn); Xaa at res. 77 = 
(Asp, Glu, Asn or Ser); Xaa at res. 78 = (Ser, Gin, Asn 
or Tyr); Xaa at res. 79 = (Ser, Asn, Asp or Glu); Xaa at 

10 res. 80 = (Asn, Thr or Lys); Xaa at res. 82 = (lie or 
Val); Xaa at res. 84 * (Lys or Arg); Xaa at res. 85 = 
(Lys, Asn, Gin or His); Xaa at res. 86 = (Tyr or His); 
Xaa at res. 87 » (Arg, Gin or Glu); Xaa at res. 88 = 
(Asn, Glu or Asp); Xaa at res. 90 = (Val, Thr or Ala); 

15 Xaa at res. 92 = (Arg, Lys, Val, Asp or Glu); Xaa at 

res. 93 = (Ala, Gly or Glu); and Xaa at res. 97 = (His or 
Arg); 

Generic Sequence 4 

20 

Cys Xaa Xaa Xaa Xaa Leu Tyr Val Xaa Phe 
1 5 10 

Xaa Xaa Xaa Gly Trp Xaa Xaa Trp Xaa 

15 

25 Xaa Ala Pro Xaa Gly Xaa Xaa Ala 

20 25 
Xaa Tyr Cys Xaa Gly Xaa Cys Xaa 
30 35 
Xaa Pro Xaa Xaa Xaa Xaa Xaa 
30 40 
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Xaa Xaa Xaa Asn His Ala Xaa Xaa 
45 50 
Xaa Xaa Leu Xaa Xaa Xaa Xaa Xaa 

55 

5 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys 

60 65 
Cys Xaa Pro Xaa Xaa Xaa Xaa Xaa 
70 

Xaa Xaa Xaa Leu Xaa Xaa Xaa 
10 75 80 

Xaa Xaa Xaa Xaa Val Xaa Leu Xaa 
85 

Xaa Xaa Xaa Xaa Met Xaa Val Xaa 
90 95 
15 Xaa Cys Gly Cys Xaa 

100 

wherein each Xaa is independently selected from a group 
of one or more specified amino acids as defined by the 
following: "Res." means "residue" and Xaa at res. 2 = 

20 (Lys or Arg); Xaa at res. 3 = (Lys or Arg); Xaa at res. 4 
= (His or Arg); Xaa at res. 5 = (Glu, Ser, His, Gly, Arg 
or Pro); Xaa at res. 9 = (Ser, Asp or Glu); Xaa at 
res. 11 « (Arg, Gin, Ser or Lys); Xaa at res. 12 = (Asp 
or Glu); Xaa at res. 13 = (Leu or Val); Xaa at res. 16 = 

25 (Gin, Leu, Asp, His or Asn); Xaa at res. 17 = (Asp, Arg, 
or Asn); Xaa at res. 19 » (lie or Val); Xaa at res. 20 = 
(lie or Val); Xaa at res. 23 = (Glu, Gin, Leu, Lys, Pro 
or Arg); Xaa at res. 25 *= (Tyr or Phe); Xaa at res. 26 = 
(Ala, Ser, Asp, Met, His, Leu, or Gin); Xaa at res. 28 = 

30 (Tyr, Asn or Phe); Xaa at res. 31 = (Glu, His, Tyr, Asp 
or Gin); Xaa at res. 33 « Glu, Lys, Asp or Gin); Xaa at 
res. 35 = (Ala, Ser or Pro); Xaa at res. 36 = (Phe, Leu 
or Tyr); Xaa at res. 38 = (Leu or Val); Xaa at res. 39 = 
(Asn, Asp, Ala or Thr); Xaa at res. 40 = (Ser, Asp, Glu, 
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Leu or Ala); Xaa at res ,41 ■ (Tyr, Cys, His, Ser or 
He); Xaa at res. 42 = (Met, Phe, Gly or Leu); Xaa at 
res. 44 = (Ala, Ser or Gly); Xaa at res. 45 » (Thr, Leu 
or Ser); Xaa at res. 49 = (He or Val); Xaa at res. 50 = 
5 (Val or Leu); Xaa at res. 51 = (Gin or Arg); Xaa at 
res. 52 * (Thr, Ala or Ser); Xaa at res. 54 = (Val or 
Met); Xaa at res. 55 = (His or Asn); Xaa at res. 56 = 
(Phe, Leu, Asn, Ser, Ala or Val); Xaa at res. 57 = (He, 
Met, Asn, Ala or Val); Xaa at res. 58 « (Asn, Lys, Ala 

10 or Glu); Xaa at res. 59 ■ (Pro or Ser); Xaa at res. 60 = 
(Glu, Asp, or Gly); Xaa at res. 61 = (Thr, Ala, Val, 
Lys, Asp, Tyr, Ser or Ala); Xaa at res. 62 = (Val, Ala 
or He); Xaa at res. 63 « (Pro or Asp); Xaa at res. 64 = 
(Lys or Leu); Xaa at res. 65 = (Pro or Ala); Xaa at 

15 res. 68 = (Ala or Val); Xaa at res. 70 « (Thr or Ala); 

Xaa at res. 71 » (Gin, Lys, Arg or Glu); Xaa at res. 72 = 
(Leu, Met or Val); Xaa at res. 73 = (Asn, Ser or Asp); 
Xaa at res. 74 « (Ala, Pro or Ser); Xaa at res. 75 « 
(He, Thr or Val); Xaa at res. 76 » (Ser or Ala); Xaa at 

20 res. 77 = (Val or Met); Xaa at res. 79 = (Tyr or Phe); 

Xaa at res. 80 = (Phe, Tyr or Leu); Xaa at res. 81 = (Asp 
or Asn); Xaa at res. 82 » (Asp, Glu, Asn or Ser); Xaa at 
res. 83 = (Ser, Gin, Asn or Tyr); Xaa at res. 84 « (Ser, 
Asn, Asp or Glu); Xaa at res. 85 = (Asn, Thr or Lys); 

25 Xaa at res. 87 = (He or Val); Xaa at res. 89 = (Lys or 
Arg); Xaa at res. 90 « (Lys, Asn, Gin or His); Xaa at 
res. 91 = (Tyr or His); Xaa at res. 92 « (Arg, Gin or 
Glu); Xaa at res. 93 = (Asn, Glu or Asp); Xaa at res. 95 
= (Val, Thr or Ala); Xaa at res. 97 « (Arg, Lys, Val, 

30 Asp or Glu); Xaa at res. 98 « (Ala, Gly or Glu); and Xaa 
at res. 102 « (His or Arg). 
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Similarly, Generic Sequence 5 (Seq. ID No. 30) and 
Generic Sequence 6 (Seq. ID No. 31) accommodate the 
homologies shared among all the morphogen protein 
family members identified in Table II. Specifically, 
5 Generic Sequences 5 and 6 are composite amino acid 
sequences of human OP-1 (hOP-1, Seq. ID Nos. 5 and 16- 
17), mouse OP-1 (mOP-1, Seq. ID Nos. 6 and 18-19), 
human and mouse OP-2 (Seq. ID Nos. 7, 8, and 20-22), 
CBMP2A (Seq. ID No. 9), CBMP2B (Seq. ID No. 10), DPP 

10 (from Drosophila, Seq. ID No. 11), Vgl, (from Xenopus, 
Seq. ID No. 12), Vgr-1 (from mouse, Seq. ID No. 13), 
and GDF-1 (from mouse, Seq. ID No. 14), human BMP 3 
(Seq. ID No. 26), human BMP 5 (Seq. ID No. 27), human 
BMP 6 (Seq. ID No. 28) and 60(A) (from Drosophila, Seq. 

15 ID Nos. 24-25). The generic sequences include both the 
amino acid identity shared by these sequences in the 
C-terminal domain, defined by the six and seven 
cysteine skeletons (Generic Sequences 5 and 6, 
respectively), as well as alternative residues for the 

20 variable positions within the sequence. As for Generic 
Sequences 3 and 4, Generic Sequences 5 and 6 allow for 
an additional cysteine at position 41 (Generic Sequence 
5) or position 46 (Generic Sequence 6), providing an 
appropriate cysteine skeleton where inter- or 

25 intramolecular disulfide bonds can form, and containing 
certain critical amino acids which influence the 
tertiary structure of the proteins. 
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Generic Seauence 5 

Leu Xaa Xaa Xaa Phe 
1 5 
5 Xaa Xaa Xaa Gly Trp Xaa Xaa Trp Xaa 

10 

Xaa Xaa Pro Xaa Xaa Xaa Xaa Ala 

15 20 
Xaa Tyr Cys Xaa Gly Xaa Cys Xaa 
10 25 30 

Xaa Pro Xaa Xaa Xaa Xaa Xaa 

35 

Xaa Xaa Xaa Asn Bis Ala Xaa Xaa 
40 45 
15 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

50 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys 

55 60 
Cys Xaa Pro Xaa Xaa Xaa Xaa Xaa 
20 65 

Xaa Xaa Xaa Leu Xaa Xaa Xaa 
70 75 
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Xaa Xaa Xaa Xaa Val Xaa Leu Xaa 
80 

Xaa Xaa Xaa Xaa Met Xaa Val Xaa 
85 90 
5 Xaa Cys Xaa Cys Xaa 

95 

wherein each Xaa is independently selected from a group 
of one or more specified amino acids defined as 
follows: "Res." means "residue" and Xaa at res. 2 = 

10 (Tyr or Lys); Xaa at res. 3 = Val or He); Xaa at res. 4 
= (Ser, Asp or Glu); Xaa at res. 6 = (Arg, Gin, Ser, Lys 
or Ala); Xaa at res.7 ■ (Asp, Glu or Lys); Xaa at res. 8 
- (Leu, Val or He); Xaa at res. 11 = (Gin, Leu, Asp, 
His, Asn or Ser); Xaa at res. 12 ■ (Asp, Arg, Asn or 

15 Glu); Xaa at res. 14 - (He or Val); Xaa at res. 15 = 

(He or Val); Xaa at res. 16 (Ala or Ser); Xaa at res. 18 
= (Glu, Gin, Leu, Lys, Pro or Arg); Xaa at res. 19 = 
(Gly or Ser); Xaa at res. 20 = (Tyr or Phe); Xaa at 
res. 21 = (Ala, Ser, Asp, Met, His, Gin, Leu or Gly); 

20 Xaa at res. 23 = (Tyr, Asn or Phe); Xaa at res. 26 = 

(Glu, His, Tyr, Asp, Gin or Ser); Xaa at res. 28 « (Glu, 
Lys, Asp, Gin or Ala); Xaa at res. 30 « (Ala, Ser, Pro, 
Gin or Asn); Xaa at res. 31 - (Phe, Leu or Tyr); Xaa at 
res. 33 » (Leu, Val or Met); Xaa at res. 34 « (Asn, Asp, 

25 Ala, Thr or Pro); Xaa at res. 35 « (Ser, Asp, Glu, Leu, 
Ala or Lys); Xaa at res. 36 = (Tyr, Cys, His, Ser or 
He); Xaa at res. 37 = (Met, Phe, Gly or Leu); Xaa at 
res. 38 - (Asn, Ser or Lys); Xaa at res. 39 = (Ala, Ser, 
Gly or Pro); Xaa at res. 40 = (Thr, Leu or Ser); Xaa at 

30 res. 44 = (He, Val or Thr); Xaa at res. 45 = (Val, Leu 
or He); Xaa at res. 46 = (Gin or Arg); Xaa at res. 47 = 
(Thr, Ala or Ser); Xaa at res. 48 = (Leu or He); Xaa at 
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res. 49 « (Val or Met); Xaa at res. 50 = (His, Asn or 
Arg); Xaa at res. 51 = (Phe, Leu, Asn, Ser, Ala or Val); 
Xaa at res. 52 = (He, Met, Asn, Ala, Val or Leu); Xaa 
at res. 53 = (Asn, Lys, Ala, Glu, Gly or Phe); Xaa at 
5 res. 54 = (Pro, Ser or Val); Xaa at res. 55 = (Glu, Asp, 
Asn, Gly, Val or Lys); Xaa at res. 56 = (Thr, Ala, Val, 
Lys, Asp, Tyr, Ser, Ala, Pro or His); Xaa at res. 57 = 
(Val, Ala or He); Xaa at res. 58 « (Pro or Asp); Xaa at 
res. 59 = (Lys, Leu or Glu); Xaa at res. 60 = (Pro or 

10 Ala); Xaa at res. 63 = (Ala or Val); Xaa at res. 65 * 
(Thr, Ala or Glu); Xaa at res. 66 = (Gin, Lys, Arg or 
Glu); Xaa at res. 67 * (Leu, Met or Val); Xaa at res. 68 
= (Asn, Ser, Asp or Gly); Xaa at res. 69 » (Ala, Pro or 
Ser); Xaa at res. 70 = (He, Thr, Val or Leu); Xaa at 

15 res. 71 « (Ser, Ala or Pro); Xaa at res. 72 = (Val, Met 
or He); Xaa at res. 74 * (Tyr or Phe); Xaa at res. 75 = 
(Phe, Tyr, Leu or His); Xaa at res. 76 = (Asp, Asn or 
Leu); Xaa at res. 77 « (Asp, Glu, Asn or Ser); Xaa at 
res. 78 = (Ser, Gin, Asn, Tyr or Asp); Xaa at res. 79 » 

20 (Ser, Asn, Asp, Glu or Lys); Xaa at res. 80 = (Asn, Thr 
or Lys); Xaa at res. 82 - (He, Val or Asn); Xaa at 
res. 84 = (Lys or Arg); Xaa at res. 85 = (Lys, Asn, Gin, 
His or Val); Xaa at res. 86 « (Tyr or His); Xaa at 
res. 87 « (Arg, Gin, Glu or Pro); Xaa at res. 88 « (Asn, 

25 Glu or Asp); Xaa at res. 90 = (Val, Thr, Ala or He); 
Xaa at res. 92 = (Arg, Lys, Val, Asp or Glu); Xaa at 
res. 93 - (Ala, Gly, Glu or Ser); Xaa at res. 95 = (Gly 
or Ala) and Xaa at res. 97 » (His or Arg). 
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Generic Sequence 6 

Cys Xaa Xaa Xaa Xaa Leu Xaa Xaa Xaa Phe 
15 10 
5 Xaa Xaa Xaa Gly Trp xaa Xaa Trp Xaa 

15 

Xaa Xaa Pro Xaa Xaa Xaa Xaa Ala 

20 25 
Xaa Tyr Cys Xaa Gly Xaa Cys Xaa 
10 30 35 

Xaa Pro Xaa Xaa Xaa Xaa Xaa 

40 

Xaa Xaa Xaa Asn His Ala Xaa Xaa 
45 50 
15 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

55 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys 

60 65 
Cys Xaa Pro Xaa Xaa Xaa Xaa Xaa 
20 70 

Xaa Xaa Xaa Leu Xaa Xaa Xaa 

75 80 
Xaa Xaa Xaa Xaa Val Xaa Leu Xaa 
85 

25 Xaa Xaa Xaa Xaa Met Xaa Val Xaa 

90 95 
Xaa Cys Xaa Cys Xaa 
100 

30 wherein each Xaa is independently selected from a group 
of one or more specified amino acids as defined by the 
following: "Res." means "residue" and Xaa at res. 2 = 
(Lys, Arg, Ala or Gin); Xaa at res. 3 = (Lys, Arg or 
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Met ) ; Xaa at res . 4 - ( His , Arg or Gin ) ; Xaa at res . 5 = 
(Glu, Ser, His, Gly, Arg, Pro, Thr, or Tyr); Xaa at 
res. 7 = (Tyr or Lys); Xaa at res. 8 = (Val or He); Xaa 
at res. 9 » (Ser, Asp or Glu); Xaa at res. 11 ■ (Arg, 
5 Gin, Ser, Lys or Ala); Xaa at res. 12 = (Asp, Glu, or 
Lys); Xaa at res. 13 ■ (Leu, Val or He); Xaa at res. 16 
- (Gin, Leu, Asp, His, Asn or Ser); Xaa at res. 17 - 
(Asp, Arg, Asn or Glu); Xaa at res. 19 « (He or Val); 
Xaa at res. 20 ■ (He or Val); Xaa at res. 21 - (Ala or 

10 Ser); Xaa at res. 23 » (Glu, Gin, Leu, Lys, Pro or Arg); 
Xaa at res. 24 - (Gly or Ser); Xaa at res. 25 = (Tyr or 
Phe); Xaa at res. 26 - (Ala, Ser, Asp, Met, His, Gin, 
Leu, or Gly); Xaa at res. 28 = (Tyr, Asn or Phe); Xaa at 
res. 31 - (Glu, His, Tyr, Asp, Gin or Ser); Xaa at 

15 res. 33 = Glu, Lys, Asp, Gin or Ala); Xaa at res. 35 ■ 
(Ala, Ser, Pro, Gin or Asn); Xaa at res. 36 = (Phe, Leu 
or Tyr); Xaa at res. 38 «= (Leu, Val or Met); Xaa at 
res. 39 - (Asn, Asp, Ala, Thr or Pro); Xaa at res. 40 = 
(Ser, Asp, Glu, Leu, Ala or Lys); Xaa at res. 41 » (Tyr, 

20 Cys, His, Ser or He); Xaa at res. 42 - (Met, Phe, Gly 
or Leu); Xaa at res. 43 - (Asn, Ser or Lys); Xaa at 
res. 44 = (Ala, Ser, Gly or Pro); Xaa at res. 45 = (Thr, 
Leu or Ser); Xaa at res. 49 = (He, Val or Thr); Xaa at 
res. 50 ■ (Val, Leu or He); Xaa at res. 51 - (Gin or 

25 Arg); Xaa at res. 52 - (Thr, Ala or Ser); Xaa at res. 53 
= (Leu or He); Xaa at res. 54 = (Val or Met); Xaa at 
res. 55 = (His, Asn or Arg); Xaa at res. 56 '■ (Phe, Leu, 
Asn, Ser, Ala or Val); Xaa at res. 57 = (He, Met, Asn, 
Ala, Val or Leu); Xaa at res. 58 = (Asn, Lys, Ala, Glu, 

30 Gly or Phe); Xaa at res. 59 <= (Pro, Ser or Val); Xaa at 
res. 60 - (Glu, Asp, Gly, Val or Lys); Xaa at res. 61 - 
(Thr, Ala, Val, Lys, Asp, Tyr, Ser, Ala, Pro or His); 
Xaa at res. 62 = (Val, Ala or He); Xaa at res. 63 - (Pro 
or Asp ) ; Xaa at res . 64 = ( Lys , Leu or Glu ) ; Xaa at 
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res. 65 ■ (Pro or Ala); Xaa at res. 68 - (Ala or Val); 
Xaa at res. 70 - (Thr f Ala or Glu) ; Xaa at res. 71 = 
(Gin, Lys, Arg or Glu)? Xaa at res. 72 - (Leu, Met or 
Val); Xaa at res. 73 - (Asn, Ser, Asp or Gly); Xaa at 
5 res. 74 « (Ala, Pro or Ser); Xaa at res. 75 = (He, Thr, 
Val or Leu); Xaa at res. 76 - (Ser, Ala or Pro); Xaa at 
res. 77 = (Val, Met or lie); Xaa at res. 79 - (Tyr or 
Phe); Xaa at res. 80 ■ (Phe, Tyr, Leu or His); xaa at 
res. 81 ■ (Asp, Asn or Leu); Xaa at res. 82 ■ (Asp, Glu, 

10 Asn or Ser); Xaa at res. 83 = (Ser, Gin, Asn, Tyr or 
Asp); Xaa at res. 84 - (Ser, Asn, Asp, Glu or Lys); Xaa 
at res. 85 = (Asn, Thr or Lys); Xaa at res. 87 = (He, 
Val or Asn); Xaa at res. 89 = (Lys or Arg); Xaa at 
res. 90 - (Lys, Asn, Gin, His or Val); Xaa at res. 91 - 

15 (Tyr or His); Xaa at res. 92 - (Arg, Gin, Glu or Pro); 
xaa at res. 93 ■ (Asn, Glu or Asp); Xaa at res. 95 - 
(Val, Thr, Ala or He); Xaa at res. 97 - (Arg, Lys, Val, 
Asp or Glu); Xaa at res. 98 - (Ala, Gly, Glu or Ser); 
Xaa at res. 100 = (Gly or Ala); and Xaa at res. 102 = 

20 (His or Arg) . 

Particularly useful sequences for use as morphogens 
in this invention include the C-terminal domains, e.g., 
the C-terminal 96-102 amino acid residues of Vgl, 

25 Vgr-1, DPP, OP-1, OP-2, CBMP-2A, CBMP-2B, GDF-1 (see 
Table II, below, and Seq. ID Nos. 5-14), as well as 
proteins comprising the C-terminal domains of 60A, 
BMP3, BMP 5 and BMP6 (see Seq. ID Nos. 24-28), all of 
which include at least the conserved six or seven 

30 cysteine skeleton. In addition, biosynthetic 
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constructs designed from the generic sequences , such as 
COP-1, 3-5/ 7 , 16 , disclosed in U.S. Pat. No. 
5/011,691, also are useful. Other sequences include 
the inhibins/activin proteins (see, for example, U.S. 
5 Pat. Nos. 4,968,590 and 5,011,691). Accordingly, other 
useful sequences are those sharing at least 70% amino 
acid sequence homology or "similarity", and preferably 
80% homology or similarity with any of the sequences 
above. These are anticipated to include allelic, 

10 species variants and other sequence variants (e.g., 
including "muteins" or "mutant proteins"), whether 
naturally-occurring or biosynthetically produced, as 
well as novel members of this morphogenic family of 
proteins. As used herein, "amino acid sequence 

15 homology" is understood to mean amino acid sequence 

similarity, and homologous sequences share identical or 
similar amino acids, where similar amino acids are 
conserved amino acids as defined by Dayoff et al., 
Atlas of Protein Sequence and Structure ; vol.5, 

20 Suppl.3, pp. 345-362 (M.O. Dayoff, ed., Nat'l BioMed. 
Research Fdn., Washington D.C. 1978.) Thus, a 
candidate sequence sharing 70% amino acid homology with 
a reference sequence requires that, following alignment 
of the candidate sequence with the reference sequence, 

25 70% of the amino acids in the candidate sequence are 
identical to the corresponding amino acid in the 
reference sequence, or constitute a conserved amino 
acid change thereto. "Amino acid sequence identity" is 
understood to require identical amino acids between two 

30 aligned sequences. Thus, a candidate sequence sharing 
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60% amino acid identity with a reference sequence 
requires that, following alignment of the candidate 
sequence with the reference sequence, 60% of the amino 
acids in the candidate sequence are identical to the 
5 corresponding amino acid in the reference sequence. 

As used herein, all homologies and identities 
calculated use OP-1 as the reference sequence. Also as 
used herein, sequences are aligned for homology and 

10 identity calculations using the method of Needleman et 
al. (1970) J.Mol. Biol. 48;443-453 and identities 
calculated by the Align program (DNAstar, Inc.) In all 
cases, internal gaps and amino acid insertions in the 
candidate sequence as aligned are ignored when making 

15 the homology/identity calculation. 

The currently most preferred protein sequences 
useful as morphogens in this invention include those 
having greater than 60% identity, preferably greater 

20 than 65% identity, with the amino acid sequence 

defining the conserved six cysteine skeleton of hOPl 
(e.g., residues 43-139 of Seq. ID No. 5). These most 
preferred sequences include both allelic and species 
variants of the OP-1 and OP-2 proteins, including the 

25 Drosophila 60A protein. Accordingly, in another 

preferred aspect of the invention, useful morphogens 
include active proteins comprising species of 
polypeptide chains having the generic amino acid 
sequence herein referred to as "OPX", which 

30 accommodates the homologies between the various 

identified species of OPl and OP2 (Seq. ID No. 29). 
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In still another preferred aspect of the invention, 
useful raorphogens include dimeric proteins comprising 
amino acid sequences encoded by nucleic acids that 
hybridize to DNA or RNA sequences encoding the C- 
5 terminal sequences defining the conserved seven 

cysteine domain of OP1 or OP2, e.g., nucleotides 1036- 
1341 and nucleotides 1390-1695 of Seq. ID No. 16 and 
20 , respectively, under stringent hybridization 
conditions. As used herein, stringent hybridization 
10 conditions are defined as hybridization in 40% 

formamide, 5 X SSPE, 5 X Denhardt's Solution, and 0.1% 
SDS at 37°C overnight, and washing in 0.1 X SSPE, 0.1% 
SDS at 50°C. 

15 The morphogens useful in the methods, composition 

and devices of this invention include proteins 
comprising any of the polypeptide chains described 
above, whether isolated from naturally-occurring 
sources, or produced by recombinant DNA or other 

20 synthetic techniques, and includes allelic and species 
variants of these proteins, naturally-occurring or 
biosynthetic mutants thereof, as well as various 
truncated and fusion constructs. Deletion or addition 
mutants also are envisioned to be active, including 

25 those which may alter the conserved C-terminal cysteine 
skeleton, provided that the alteration does not 
functionally disrupt the relationship of these 
cysteines in the folded structure. Accordingly, such 
active forms are considered the equivalent of the 

30 specifically described constructs disclosed herein. 
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The proteins may include forms having varying 
glycosylation patterns , varying N- termini , a family of 
related proteins having regions of amino acid sequence 
homology, and active truncated or mutated forms of 
5 native or biosynthetic proteins, produced by expression 
of recombinant DNA in host cells. 

The morphogenic proteins can be expressed from 
intact or truncated cDNA or from synthetic DNAs in 

10 procaryotic or eucaryotic host cells, and purified, 
cleaved, refolded, and dimerized to form 
morphogenically active compositions. Currently 
preferred host cells include coli or mammalian 
cells, such as CHO, COS or BSC cells. A detailed 

15 description of the morphogens useful in the methods, 

compositions and devices of this invention is disclosed 
in international application US92/01968 (WO 92/15323) 
the disclosure of which is incorporated herein by 
reference. 

20 

Thus, in view of this disclosure, skilled genetic 
engineers can isolate genes from cDNA or genomic 
libraries of various different species which encode 
appropriate amino acid sequences, or construct DNAs 

25 from oligonucleotides, and then can express them in 

various types of host cells, including both procaryotes 
and eucaryotes, to produce large quantities of active 
proteins useful as dietary compositions for enhancing 
tissue morphogenesis, including enhancing tissue 

30 development and tisssue viability in a variety of 
mammals, including humans. 
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The foregoing and other objects, features and 
advantages of the present invention will be made more 
apparent from the following detailed description of the 
invention. 
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Brief Description of the Drawings 

The foregoing and other objects and features of 
5 this invention, as well as the invention itself, may be 
more fully understood from the following description, 
when read together with the accompanying drawings, in 
which: 

10 FIG. 1A and B shows relative amounts of protein 

present in mammary gland extract eluate fractions of a 
C-18 reverse phase chromatography column (A), and the 
corresponding results of a Western Blot (B); 

15 

FIG. 2A and B shows relative amounts of protein 
present in bovine colostrum eluate fractions from 
purification scheme A of a C-18 reverse phase 
chromatography column (A), and the corresponding 
20 results of a Western blot under reduced (1) and 
oxidized (2) conditions (B); 

FIG. 3A and B shows relative amounts of protein 
present in bovine colostrum eluate fractions from 
25 purification scheme B of a C-18 reverse phase 
chromatography column (A), and the corresponding 
results of a Western Blot under reduced conditions (B); 

FIG. 4A and B shows relative amounts of protein 
30 present in bovine 57 day milk eluate fractions of a C- 
18 reverse phase chromatography column (A), and the 
corresponding results of a Western Blot under reduced 
(1) and oxidized (2) conditions (B); 
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FIG. 5 shows Western Blot analysis of bovine 
colostrum using OP-1 and BMP2-specif ic antibodies; 

FIG. 6A and B show results of in vivo and in vitro 
5 activity assays, respectively, for the corresponding 
fractions shown in Fig. 1; 

FIG. 7 is a photomicrograph of an immunoblot 
showing the presence of hOP-1 in serum; and 

10 

FIG. 8A is a dose response curve for the induction 
of the 180 kDa and 140 kDa N-CAM isoforms in morphogen- 
treated NG108-15 cells; 

15 FIG. 8B is a photomicrograph of a Western blot of 

whole cell extracts from morphogen-treated NG108-15 
cells with an N-CAM-specif ic antibody; and 

FIG. 9 (A and B) are photomicrographs showing the 
20 effect of morphogen-specif ic antibody on mouse 

development (9B) compared to untreated, control mice 
(9A). 
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Detailed Description of the Invention 

It now has been discovered that the proteins 
5 described herein are found in nursing mother's milk and 
are useful as components of a dietary composition for 
enhancing tissue morphogenesis in a mammal, 
particularly in an individual at risk for normal tissue 
development and vibility. As described herein, these 
10 proteins ("morphogens") are capable of enhancing tissue 
development in growing mammals, stimulating CAM 
expression and maintaining the normal tissue function 
in adult tissue. 

15 Provided below are detailed descriptions of 

suitable morphogens useful in the compositions and 
methods of this invention, as well as methods for their 
administration and application, and numerous, 
nonlimiting examples which demonstrate the suitability 

20 of the morphogens described herein as active components 
of a dietary composition for a mammal; and 2) provide 
assays with which to test candidate morphogens for 
their efficacy. Specifically, examples are provided 
which (1) demonstrate the presence of endogenous 

25 morphogen in milk and human serum (Examples 1 and 2), 
(2) demonstrate the ability of morphogens to induce CAM 
expression in a mammal (Example 3), (3) demonstrate the 
ability of morphogens to enhance tissue development in 
developing embryos (Example 4) and juveniles 

30 (Example 5); (4) demonstrate the ability of morphogens 
to reduce an osteoporotic condition in a mammal 
(Example 6); (5) demonstrate the presence of morphogens 
in developing tissues and adult stomach and gut tissue, 
demonstrate the ability of parenterally provided 
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raorphogen to localize to stomach tissue, and describe 
protocols for identifying raorphogen-synthesizing tissue 
(Example 7) and (6) describe protocols for obtaining 
morphogen-specific antibodies and measuring morphogens 
5 in solution (Example 8). 

I. Useful Morphogens 

As defined herein a protein is morphogenic if it is 

10 capable of inducing the developmental cascade of 
cellular and molecular events that culminate in the 
formation of new, organ-specific tissue and comprises 
at least the conserved C-terminal six cysteine skeleton 
or its functional equivalent (see supra). 

15 Specifically, the morphogens generally are capable of 
all of the following biological functions in a 
morphogenically permissive environment: stimulating 
proliferation of progenitor cells; stimulating the 
differentiation of progenitor cells; stimulating the 

20 proliferation of differentiated cells; and supporting 
the growth and maintenance of differentiated cells. 
Details of how the morphogens useful in the method of 
this invention first were identified, as well as a 
description on how to make, use and test them for 

25 morphogenic activity are disclosed in international 
application US92/01968 (WO 92/15323). As disclosed 
therein, the morphogens may be purified from naturally- 
sour ced material or recombinant ly produced from 
procaryotic or eucaryotic host cells, using the genetic 

30 sequences disclosed therein. Alternatively, novel 
morphogenic sequences may be identified following the 
procedures disclosed therein. 
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Particularly useful proteins include those which 
comprise the naturally derived sequences disclosed in 
Table II. Other useful sequences include biosynthetic 
constructs such as those disclosed in U.S. Pat. 
5 5,011,691, the disclosure of which is incorporated 
herein by reference (e.g., COP-1, COP-3, COP-4, COP-5, 
COP-7, and COP-16). 

Accordingly, the morphogens useful in the methods 
10 and compositions of this invention also may be 

described by morphogenically active proteins having 
amino acid sequences sharing 70% or, preferably, 80% 
homology (similarity) with any of the sequences 
described above, where "homology" is as defined herein 
15 above. 

The morphogens useful in the method of this 
invention also can be described by any of the 6 generic 
sequences described herein (Generic Sequences 1, 2, 3, 
20 4, 5 and 6). Generic sequences 1 and 2 also may 
include, at their N-terminus, the sequence 

Cys Xaa Xaa Xaa Xaa (Seq. ID No. 15) 
1 5 

25 

Table II, set forth below, compares the amino acid 
sequences of the active regions of native proteins that 
have been identified as morphogens, including human 
OP-1 (hOP-1, Seq. ID Nos. 5 and 16-17), mouse OP-1 
30 (mOP-1, Seq. ID Nos. 6 and 18-19), human and mouse OP-2 
(Seq. ID Nos. 7, 8, and 20-23), CBMP2A (Seq. ID No. 9), 
CBMP2B (Seq. ID No. 10), BMP 3 (Seq. ID No. 26), DPP 
(from Drosophila, Seq. ID No. 11), Vgl, (from Xenopus, 
Seq. ID No. 12), Vgr-1 (from mouse, Seq. ID No. 13), 



WO 94/03075 



PCT/US93/07190 



- 41 - 

GDF-1 (from mouse, Seg. ID Nos. 14, 32 and 33), 6 OA 
protein (from Drosophila, Seq. ID Nos. 24 and 25), BMP 5 
(Seg. ID No. 27) and BMP 6 (Seg. ID No. 28). The 
sequences are aligned essentially following the method 
5 of Needleman et al. (1970) J. Mol. Biol. , 48:443-453, 
calculated using the Align Program (DNAstar, Inc.) In 
the table, three dots indicates that the amino acid in 
that position is the same as the amino acid in hOP-1. 
Three dashes indicates that no amino acid is present in 

10 that position, and are included for purposes of 
illustrating homologies. For example, amino acid 
residue 60 of CBMP-2A and CBMP-2B is "missing". Of 
course, both these amino acid sequences in this region 
comprise As n- Ser (residues 58, 59), with CBMP-2A then 

15 comprising Lys and lie, whereas CBMP-2B comprises Ser 
and lie. 



20 



25 



h0P-l 

m0P-l 

h0P-2 

m0P-2 

DPP 

Vgl 

Vgr-1 

CBMP-2A 

CBHP-2B 



TABLE II 



Cys Lys Lys His Glu Leu Tyr Val 



i . • • • • 



Arg Arg 

Arg Arg 

Arg Arg ... Ser 

Lys Arg His 
••* Gly 

Arg • . ♦ Pro 

Arg Arg ... Ser 
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BMP3 


... Ala 


Arg 


Arg 


Tyr 


Lys 


GDF-1 


Arg 


Ala 


Arg 


Arg 




60A 


... Gin 


Het 


Glu 


Thr 


• * • • 


BHP5 




• • • 


• • • 






5 BMP6 


Arg 


• • • 


• • • 


• • • • • 


• • • * 



10 



15 



20 



hOP-l 

mOP-1 

hOP-2 

mOP-2 

DPP 

Vgl 

Vgr-1 

CBMP-2A 

CBMP-2B 

BMP 3 

GDF-1 

60A 

BMP5 

BMP 6 



Ser the Arg Asp Leu Gly Trp Gin Asp 



Ser 
Asp 
Glu 

• • • 

Asp 
Asp 
Asp 

• • • 
Asp 



Gin 

• • • 

Ser 
Lys 
Gin 
Ser 
Ser 
Ala 

• • • . 

Lys 

• • • 
Gin 



Glu 



Val 
Val 
Val 
Val 
Val 
He 
Val 



Leu 
Leu 
Asp 



Asn 



• • • • • • 

Asn ♦ ♦ ♦ 
Asn 

Ser Glu 

His Arg 
His 

• • • • • • 



10 



15 



25 



30 



hOP-l 

mOP-l 

hOP-2 

n0P-2 

DPP 

Vgl 

Vgr-1 

CBMP-2A 



Trp He He Ala Pro Glu Gly Tyr Ala 



i • • • • 



Val 
Val 

• • • 

Val 



Val 



Val 



Gin 
Gin 
Leu 
Gin 
Lys 
Pro 



Ser 
Ser 
Asp 
Met 

His 
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25 



CBMP-2B 




• • • 


Val 


... 




Pro 


... 


... 


Gin 


BHP3 


... 


• • • 


• • * 


Ser 


• '• • 


Lys 


Ser 


Phe 


Asp 


GDF-1 


• • • 


Val 




... 


... 


Arg 


... 


Phe 


Leu 


60A 


... 


• • • 


. • * 


... 


... 


... 


... 


... 


Gly 


BMP5 








• • ♦ 


... 


... 


... 


... 


... 


BMP 6 


• • « 


• • • 


• • « 


... 


... 


Lys 


... 


... 


... 








20 










25 




hOP-1 


Ala 


Tyr 




Cys 


Glu 


Gly 


Glu 


Cys 


Ala 


mOP-1 




. * » 


... 




... 


... 


... 




... 


hOP-2 




• • • 






. • • 


... 


... 




Ser 


mOP-2 




• • • 






... 


.... 


... 




. • • 


DPP 




• • • 






His 


... 


Lys 




Pro 


Vgl 


... 


Asn 


... 


... 


Tyr 


... 


... 




Pro 


Vgr-1 


• • * 


Asn 


... 


. . • 


Asp 


... 


... 




Ser 


CBMP-2A 




Fhe 






His 




Glu 




Pro 


CBMP-2B 


... 


Phe 


... 




His 


... 


Asp 




Pro 


BMP3 




• • • 


... 


... 


Ser 


... 


Ala 




Gin 


GDF-1 


» . • 


Asn 


... 


... 


Gin 


... 


Gin 




■ . • 


60A 


... 


Phe 


... 


... 


Ser 


... 


... 




Asn 


BKP5 


• • • 


Phe 


... 


... 


Asp 


... 


... 




Ser 


BMP6 


... 


Asn 


... 


... 


Asp 


... 


... 




Ser 










30 










35 


hOP-1 


Phe 


Pro 


Leu 


Asn 


Ser 


Tyr 


Met 


Asn 


Ala 


mOP-1 








... 




... 


... 






hOP-2 


• • • 






Asp 


... 


Cys 


... 




... 


mOP-2 


• * • 






Asp 


. • • 


Cys 






... 


DPP 


• . • 






Ala 


Asp 


His 


Phe 




Ser 


Vgl 


Tyr 






Thr 


Glu 


He 


Leu 




Gly 


Vgr-1 


• * • 






... 


Ala 


His 


... 




... 


CBHP-2A 


♦ • • 






Ala 


Asp 


His 


Leu 




Ser 


CBHP-2B 


♦ * • 






Ala 


Asp 


His 


Leu 




Ser 
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GDF-1 


Leu 


... 


Val 


Ala 


Leu 


Ser 


Gly 


Ser** 


... 


BHP3 


• • • 


... 


Met 


Pro 


Lys 


Ser 


Leu 


Lys 


Pro 


60A 


• • • 


... 


... 


... 


Ala 


His 


... 


... 


... 


BHF5 


• • • 


... 


... 


... 


Ala 


His 


Het 


... 


... 


BHP6 


• • • 


... 


... 


... 


Ala 


His 


Het 


... 


... 












40 










hOP-1 


Thr 


Asn 


His 


Ala 


lie 


Val 


Gin 


Thr 


Leu 


mOP-1 


* . * 




... 


... 


... 


... 




... 


... 


hOP-2 


• • • 


... 


... 


... 


... 


Leu 


... 


Ser 


... 


mOP-2 


• • • 


... 


... 


... 


... 


Leu 


... 


Ser 


... 


DPP 


# • * 


... 


... 


... 


Val 


... 


... 


... 




Vgl 


Ser 




... 


... 


... 


Leu 


... 




... 


Vgr-1 


. * • 


::: 






... 


... 


... 


... 


... 


CBMP-2A 


• • * 






... 


... 


... 


... 




... 


CBMP-2B 


• • • 


::: 


... 




... 


... 


... 


... 


... 


BHP3 


Ser 


... 


... 


... 


Thr 


He 


... 


Ser 


He 


GDF-1 


Leu 


... 


... 


... 


Val 


Leu 


Arg 


Ala 


... 


60A 




... 


... 


... 


... 


... 


... 


... 


... 


BHP5 


• • • 


... 


... 




... 


... 


... 


... 


... 


BMP6 








... 




... 


... 




... 




45 










50 








hOP-1 


Val 


His 


Phe 


He 


Asn 


Pro 


Glu 


Thr 


Val 


mOP-l 




... 


... 


... 


... 




Asp 


... 


... 


hOP-2 




His 


Leu 


Het 


Lys 




Asn 


Ala 


... 


mOP-2 




His 


Leu 


Het 


Lys 




Asp 


Val 


... 


DPP 




Asn 


Asn 


Asn 


... 




Gly 


Lys 




Vgl 




... 


Ser 


... 


Glu 




... 


Asp 


lie 


Vgr-1 




... 


Val 


Het 








Tyr 


... 


CBMP-2A 




Asn 


Ser 


Val 


... 


Ser 




Lys 


He 


CBHP-2B 




Asn 


Ser 


Val 


... 


Ser 




Ser 


lie 


BMP3 




Arg 


Ala** Gly 


Val 


Val 


Pro 


Gly • 


He 
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25 



GDF-1 


Met 


• • • 


Ala 


Ala 


Ala 


• * • 


Gly 


Ala 


Ala 


60A 


• • • 


• • • 


Leu 


Leu 


Glu 


• * • 


Lys 


Lys 


• • • 


BMP 5 


• • • 


• • • 


Leu 


Met 


Fhe 


• • • 


Asp 


His 


... 


BMP 6 


• • • 


• • • 


Leu 


Met 


• • • 


• • • 


• ♦ • 


Tyr 


■ . • 






55 










60 






hOP-1 


Pro 


Lys 


Pro 


Cys 


Cys 


Ala 


Pro 


Thr 


Gin 


mOP-1 




• • • 


• • » 




... 


• • » 


* • ♦ 


• • • 


... 


hOP-2 






Ala 






• « * 


... 


... 


Lys 


mOP-2 


* * • 


* • • 


Ala 






• • * 






Lys 


DPP 






Ala 






Val 






♦ • • 


Vgl 


• • • 


Leu 


* * • 


... 


... 


Val 


... 


... 


Lys 


Vgr-1 


• • • 


• • • 


• • • 


• • * 


* ♦ • 


... 


* . • 


• • . 


Lys 


CBHP-2A 




• • • 


Ala 






Val 






Glu 


CBHP-2B 


• • • 


• • • 


Ala 


... 


• « • 


Val 


... 


• . • 


Glu 


BMP3 


• • • 


Glu 


• • • 


• • • 


... 


Val 


• • . 


Glu 


Lys 


GDF-1 


Asp 


Leu 


• • » 


• • • 


• * ♦ 


Val 


... 


Ala 


Arg 


60A 




• • • 


• • • 


... 


• . . 


• • • 




... 


Arg 


BMP5 


• • * 


* . • 


• • • 


• • • 


... 


• . • 






Lys 


BMP6 




• • • 


• • • 


• • • 


• • • 


• « • 


. . • 


• • • 


Lys 








65 










70 




hOP-1 


Leu 


Asn 


Ala 


He 


Ser 


Val 


Leu 


Tyr 


Phe 


mOP-1 








• • • 




• • • 




• • • 




hOP-2 


• • • 


Ser 


• • * 


Thr 


• » • 






• • • 


Tyr 


mOP-2 


• • • 


Ser 


• • • 


Thr 


• * * 


• • • . 




• • • 


Tyr 


Vgl 


Met 


Ser 


Pro 




* • . 


Met 




Pbe 


Tyr 


Vgr-1 


Val 


• • » 


• • • 


• * • 


• • • 


* • » 




* • • 


• • • 


DPP 


• * • 


Asp 


Ser 


Val 


Ala 


Met 






Leu 


CBMP-2A 


• • • 


Ser 


• • • 


* • • 




Met 




• • * 


Leu 


CBHP-2B 


• • • 


Ser 


• * • 


• • • 


... 


Met 




• • » 


Leu 


BMP3 


Met 


Ser 


Ser 


Leu 




He 




Fhe 


Tyr 


GDF-1 


• • • 


Ser 


Pro 


• « • 


m • • 


... 




Fhe 


... 
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60A 

BMP5 

BMP6 



Gly 



••• • • • 



Leu Pro 



75 



> * • • • t 



His 



80 



10 



15 



hOP-1 

mOP-1 

hOP-2 

mOP-2 

DPP 

Vgl 

Vgr-1 

CBHP-2A 

CBMP-2B 

BMP 3 

GDF-1 

60A 

BMP5 

BMP 6 



Asp Asp Ser Ser Asn Val lie Leu Lys 



Asn 



Leu 



Ser 
Ser 

• • • 

Asn 

Glu 
Glu 
Glu 
Asn 
Asn 



Gin 
Asn 
Asn 
Asn 
Tjrr 
Asn 

• • • 
Asp 

• • • 
Asn 



Asn 
Asn 
« * • 
Asp 

Glu 
Asp 
Lys 
Asp 
Glu 



Thr 



Lys 
Lys 



Val 
Val 
• * • 
Val 
Val 
Val 
Val 
Asn 



Arg 
Arg 

* • • 

Arg 



Arg 



20 



85 



25 



30 



hOP-1 

mOP-1 

hOP-2 

mOP-2 

DPP 

Vgl 

Vgr-1 

CBHP-2A 

CBHP-2B 



Lys Tyr Arg Asn Met Val Val Arg 



Asn 
His 

• • • 

Asn 
Asn 



His 
His 



• • ■ • • • 

• • • • • • 

Gin Glu 
Glu 

• • • • • • 

Gin Asp 

Gin Glu 



Thr 
Ala 



Lys 
Lys 
Val 
Asp 
* • * 
Glu 
Glu 
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BMP 3 


Val 


Pro 


♦ • • mr 






GDF-1 


Gin 


Glu Asp 


... ••• 


A cn 

A&P 




60A 


• • • • • • 


... ... 


lie 


T TTC 

i*ys 




BMP 5 


• • • * • • 


• • . ... 


• * • ... 


• . . 


5 


BMP 6 


* • • • • • 


Trp 


••• ••• 


• » • 






•90 




AC 






hOP-l 


Ala Cys 


Gly Cys 


His 




10 


mOP-1 


• • • • . • 


• • • . . . 


• • « 






hOP-2 


• • • • • • 


• • • ... 


• • * 






mOP-2 


« • • • • • 


• « . . » • 


• . . 






DPP 


Gly 


• • • ... 


A. — _ 

Arg 






Vgl 


Glu 


• • * . . . 


Arg 




15 


Vgr-1 


• « • • • * 


• . . ... 


... 






CBMP-2A 


Gly 


• • * . . • 


Arg 






CBHP-2B 


Gly ... 
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Arg 






BMP 3 


Set 


Ala 


Arg 






GDF-1 


Glu 


... . . . 


Arg 
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60A 


Ser 


. • . ... 


... 






BMP5 


Ser 


• . . . . • 


... 






BMP 6 


.... ... 


• . • ... 


... 










100 








♦★Between 


residues 56 and 57 of BHP3 


is a Val residue; 




25 




between residues 43 and 


44 of GDF-1 lies 





the amino acid sequence Gly-Gly-Pro-Pro. 



As is apparent from the foregoing amino acid 
30 sequence comparisons, significant amino acid changes 
can be made within the generic sequences while 
retaining the morphogenic activity. For example, while 



WO 94/03075 * PCT/US93/07190 



the GDF-1 protein sequence depicted in Table II shares 
only about 50% amino acid identity with the hOPl 
sequence described therein, the GDF-1 sequence shares 
greater than 70% amino acid sequence homology (or 
5 "similarity") with the hOPl sequence, where "homology" 

or "similarity" includes allowed conservative amino i 
acid changes within the sequence as defined by Dayoff , 
et al., Atlas of Protein Sequence and Structure vol.5, 
supp.3, pp. 345-362, (M.O. Dayoff, ed., Nat'l BioMed. 
10 Res. Fd'n, Washington D.C. 1979.) 

The currently most preferred protein sequences [ 
useful as morphogens in this invention include those 
having greater than 60% identity, preferably greater 

15 than 65% identity, with the amino acid sequence 

defining the conserved six cysteine skeleton of hOPl 

(e.g., residues 43-139 of Seq. ID No. 5). These most 

preferred sequences include both allelic and species [ 

variants of the OP-1 and OP-2 proteins, including the 

20 Drosophila 60A protein. Accordingly, in still another 
preferred aspect, the invention includes morphogens 
comprising species of polypeptide chains having the 
generic amino acid sequence referred to herein as 
"OPX", which defines the seven cysteine skeleton and [ 

25 accommodates the identities between the various 

identified mouse and human OP1 and OP2 proteins. OPX 
is presented in Seq. ID No. 29. As described therein, 
each Xaa at a given position independently is selected 
from the residues occurring at the corresponding 

30 position in the C-terminal sequence of mouse or human [ 
OP1 or OP2 (see Seq. ID Nos. 5-8 and/or Seq. ID Nos. 
16-23). 
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II. Formulations and Methods for Administering 
Therapeutic Agents 

A. General Considerations 

5 

The morphogens may be provided to an individual by 
any suitable means, most preferably orally , or, 
alternatively, parenterally. Where the morphogen is to 
be provided parenterally, such as intravenously or by 

10 enteral feeding tube, the morphogen preferably 

comprises part of an aqueous solution. The solution is 
physiologically acceptable so that in addition to 
delivery of the desired morphogen to the patient, the 
solution does not otherwise adversely affect the 

15 patient's electrolyte and volume balance. The aqueous 
medium for the morphogen thus may comprise normal 
physiologic saline (0.85% NaCl, 0.15M), pH 7-7.4. The 
aqueous solution containing the morphogen can be made, 
for example, by dissolving the protein in 50% ethanol 

20 containing acetonitrile in 0*1% trif luoroacetic acid 
(TFA) or 0.1% HC1, or equivalent solvents. One volume 
of the resultant solution then is added, for example, 
to ten volumes of phosphate buffered saline (PBS), 
which further may include 0.1-0.2% human serum albumin 

25 (HSA). The resultant solution preferably is vortexed 
extensively. If desired, a given morphogen may be made 
more soluble by association with a suitable molecule. 
For example, the pro form of the morphogenic protein 
comprises a species that is soluble in physiologically 

30 buffered solutions. In fact, the endogenous protein is 
thought to be transported (e.g., secreted and 
circulated) in this form. This soluble form of the 
protein may be obtained from the culture medium of 
morphogen-secreting mammalian cells. Alternatively, a 
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soluble species may be formulated by complexing (e.g., 
via non-covalent interaction) the mature dimer (or an 
active fragment thereof) with part or all of one or, 
preferably, two pro domain peptides (see Section A.l, 
5 below). Another molecule capable of enhancing 
solubility and particularly useful for oral 
administrations, is casein, including derivatives and 
analogs thereof. For example, addition of 0.2% casein 
increases solubility of the mature active form of OP-1 
10 in physiologically buffered solutions by 80%. Other 
components found in milk and/or various serum proteins 
also may be useful. 

Useful solutions for parenteral administration may 
be prepared by any of the methods well known in the 
pharmaceutical art, described, for example, in 
Remington's Pharmaceutical Sciences (Gennaro, A., ed.), 
Mack Pub., 1990. Formulations may include, for 
example, polyalkylene glycols such as polyethylene 
glycol, oils of vegetable origin, hydrogenated 
naphthalenes, and the like. Biocompatible, preferably 
bioresorbable, polymers, including, for example, 
hyaluronic acid, collagen, polybutyrate, tricalcium 
phosphate, lactide and lactide/glycolide copolymers, 
may be useful excipients to control the release of the 
morphogen in vivo . 

As described above, the dietary supplements 
comprising the morphogens described herein preferably 
30 are provided orally. Oral administration of proteins 
as therapeutics generally is not practiced as most 
proteins are readily degraded by digestive enzymes and 
acids in the mammalian digestive system before they can 
be absorbed into the bloodstream. However, the 



15 



20 



25 
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morphogens described herein typically are acid stable 
and protease-resistant (see, for example, U.S. Pat. No. 
4,968,590.) In addition, at least one morphogen, OP-1, 
has been identified in mammary gland extract, colostrum 
5 and 57-day milk (see Example 1, below). Moreover, the 
OP-1 purified from mammary gland extract is 
morphogenically active. Specifically, this protein 
induces endochondral bone formation in mammals when 
implanted subcutaneously in association with a suitable 

10 matrix material, using a standard in vivo bone assay, 
such as is disclosed in U.S. Pat. No. 4,968,590. 
Moreover, the morphogen also is detected in the 
bloodstream (see Example 2, below). These findings 
indicate that oral and parenteral administration are 

15 viable means for administering morphogens to an 

individual. In addition, while the mature forms of 
certain morphogens described herein typically are 
sparingly soluble, the morphogen form found in milk 
(and mammary gland extract and colostrum) is readily 

20 soluble, probably by association of the mature, 

morphogenically active form with part or all of at 
least one pro domain peptide and/or by association with 
one or more milk components. Accordingly, the 
compounds provided herein also may be associated with 

25 molecules capable of enhancing their solubility in 
vitro or in vivo . 

The dietary compositions for oral administration 
may be formulated as a liquid, for example, as part of 
30 an aqueous medium as described above for parenteral 

administration, and which further may contain flavoring 
and coloring agents. The formulation also may be 
combined with a beverage or may be provided in a 
syrup. The dietary composition also may be provided as 



* 
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an aerosol for oral or nasal administration. 
Formulations for inhalation administration contain as 
excipients, for example, lactose, or may be aqueous 
solutions containing, for example, polyoxyethylene-9- 
5 lauryl ether, glycocholate and deoxycholate, or oily 
solutions for administration in the form of nasal 
drops, or as a gel to be applied intranasally. 
Alternatively, the dietary composition may be provided 
as a solid, for example as a tablet, capsule or 
10 lozenge. As for parenteral administration, 

formulations for oral administration also may include 
molecules to enhance a controlled release of the 
morphogen in vivo * 

15 As will be appreciated by those skilled in the art, 

the concentration of the compounds described in a given 
dietary supplement composition will vary depending upon 
a number of factors, including the dosage number to be 
administered, the chemical characteristics (e.g., 

20 hydrophobicity) of the compounds employed, and the 
route of administration. The preferred dosage to be 
administered also is likely to depend on such variables 
as the type and extent of tissue development 
enhancement desired, the type and extent of any tissue 

25 damage present to be repaired, the overall health 
status of the particular individual, the relative 
biological efficacy of the compound selected, the 
formulation of the compound excipients, and its route 
of administration. In general terms, the compounds of 

30 this invention may be provided in a formulation 
containing about 0.001 to 10% w/v of morphogen to 
formulation. Typical dose ranges are from about 10 
ng/kg to about 1 g/kg of body weight per day; a 
preferred dose range is from about 0.1 pg/kg to 
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100 mg/kg of body weight per day. Optimally, the 
morphogen dosage given is between 0.1-100 of protein 
per kilogram weight of the individual. No obvious 
morphogen induced pathological lesions are induced when 
5 mature morphogen (e.g./ OP-1, 20 fjg) is administered 
daily to normal growing rats for 21 consecutive days. 
Moreover, 10 pg systemic injections of morphogen (e.g., 
OP-1) injected daily for 10 days into normal newborn 
mice does not produce any gross abnormalities. 

10 

In administering morphogens parenterally in the 
methods of the present invention, preferably a large 
volume loading dose is used at the start of the 
treatment. The treatment then is continued with a 
15 maintenance dose. In all cases administration dosages 
then can be monitored by measuring at intervals the 
levels of the morphogen in the blood. 

A. 1 Soluble Morphogen Complexes 

20 

A currently preferred form of the morphogen useful 
in therapeutic formulations, having improved solubility 
in aqueous solutions and consisting essentially of. 
amino acids, is a dimeric morphogenic protein 

25 comprising at least the 100 amino acid peptide sequence 
having the pattern of seven or more cysteine residues 
characteristic of the morphogen family complexed with a 
peptide comprising part or all of a pro region of a 
member of the morphogen family, or an allelic, species 

30 or other sequence variant thereof. Preferably, the 
dimeric morphogenic protein is complexed with two 
peptides. Also, the dimeric morphogenic protein 
preferably is noncovalently complexed with the pro . 
region peptide or peptides. The pro region peptides 
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also preferably comprise at least the N-terminal 
eighteen amino acids that define a given morphogen 
pro region. In a most preferred embodiment/ peptides 
defining substantially the full length pro region are 
5 used. 

Other soluble forms of morphogens include dimers of 
the uncleaved pro forms of these proteins, as well as 
"hemi-dimers" wherein one subunit of the dimer is an 
10 uncleaved pro form of the protein, and the other 
subunit comprises the mature form of the protein, 
including truncated forms thereof, preferably 
noncovalently associated with a cleaved pro domain 
peptide. 

15 

As described above, useful pro domains include the 
full length pro regions, as well as various truncated 
forms hereof, particularly truncated forms cleaved at 
proteolytic Arg-Xaa-Xaa-Arg cleavage sites. For 

20 example, in OP-1, possible pro sequences include 
sequences defined by residues 30-292 (full length 
form); 48-292; and 158-292. Soluble OP-1 complex 
stability is enhanced when the pro region comprises the 
full length form rather than a truncated form, such as 

25 the 48-292 truncated form, in that residues 30-47 show 
sequence homology to the N-terminal portions of other 
morphogens, and are believed to have particular utility 
in enhancing complex stability for all morphogens. 
Accordingly, currently preferred pro sequences are 

30 those encoding the full length form of the pro region 
for a given morphogen. Other pro sequences 
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contemplated to have utility include biosynthetic pro 
sequences , particularly those that incorporate a 
sequence derived from the N-terminal portion of one or 
more morphogen pro sequences. 

5 

As will be appreciated by those having ordinary 
skill in the art, useful sequences encoding the pro 
region may be obtained from genetic sequences encoding 
known morphogens. Alternatively, chimeric pro regions 
10 can be constructed from the sequences of one or more 
known morphogens . Still another option is to create a 
synthetic sequence variant of one or more known pro 
region sequences* 

15 In another preferred aspect, useful pro region 

peptides include polypeptide chains comprising an amino 
acid sequence encoded by a nucleic acid that hybridizes 
under stringent conditions with a DNA or RNA sequence 
encoding at least the N-terminal eighteen amino acids 

20 of the pro region sequence for OPl or OP2, e.g., 

nucleotides 136-192 and 152-211 of Seq. ID No. 16 and 
20, respectively. 

A.1A Isolation of Soluble morphogen complex from 
25 conditioned media or body fluid 

Morphogens are expressed from mammalian cells as 
soluble complexes. Typically, however the complex is 
disassociated during purification, generally by 
30 exposure to denaturants often added to the purification 
solutions, such as detergents, alcohols, organic 
solvents, chaotropic agents and compounds added to 
reduce the pH of the solution. Provided below is a 
currently preferred protocol for purifying the soluble 
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proteins from conditioned media (or, optionally, a body 
fluid such as serum, cerebro- spinal or peritoneal 
fluid), under non-denaturing conditions. The method is 
rapid, reproducible and yields isolated soluble 
5 morphogen complexes in substantially pure form. 

Soluble morphogen complexes can be isolated from 
conditioned media using a simple, three step 
chromatographic protocol performed in the absence of 

10 denaturants. The protocol involves running the media 
(or body fluid) over an affinity column, followed by 
ion exchange and gel filtration chromatographies. The 
affinity column described below is a Zn-IMAC column. 
The present protocol has general applicability to the 

15 purification of a variety of morphogens, all of which 
are anticipated to be isolatable using only minor 
modifications of the protocol described below. An 
alternative protocol also envisioned to have utility an 
immunoaf f inity column, created using standard 

20 procedures and, for example, using antibody specific 
for a given morphogen pro domain (complexed, for 
example, to a protein A-conjugated Sepharose column. ) 
Protocols for developing immunoaf f inity columns are 
well described in the art, (see, for example, Guide to 

25 Protein Purification , M. Deutscher, ed., Academic 

Press, San Diego, 1990, particularly sections VII and 
XI.) 

In this experiment OP-1 was expressed in mammalian 
30 CHO (Chinese hamster ovary) cells as described in the 
art (see, for example, international application 
US90/05903 (WO91/05802) . ) The CHO cell conditioned 
media containing 0.5% FBS was initially purified using 
Immobilized Metal-Ion Affinity Chromatography (IMAC). 
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The soluble OP-1 complex from conditioned media binds 
very selectively to the Zn-IMAC resin and a high 
concentration of imidazole (50 mM imidazole, pH 8.0) is 
required for the effective elution of the bound 
5 complex. The Zn-IMAC step separates the soluble OP-1 
from the bulk of the contaminating serum proteins that 
elute in the flow through and 35 mM imidazole wash 
fractions. The Zn-IMAC purified soluble OP-1 is next 
applied to an S-Sepharose cation-exchange column 

10 equilibrated in 20 mM NaP0 4 (pH 7.0) with 50 mM NaCl. 
This S-Sepharose step serves to further purify and 
concentrate the soluble OP-1 complex in preparation for 
the following gel filtration step. The protein was 
applied to a Sephacryl S-200HR column equilibrated in 

15 TBS. Using substantially the same protocol , soluble 
morphogens also may be isolated from one or more body 
fluids , including serum, cerebro-spinal fluid or 
peritoneal fluid. 

20 IMAC was performed using Chelating-Sepharose 

(Pharmacia) that had been charged with three column 
volumes of 0.2 M ZnSO^. The conditioned media was 
titrated to pH 7.0 and applied directly to the ZN-IMAC 
resin equilibrated in 20 mM HEPES (pH 7.0) with 500 mM 

25 NaCl. The Zn-IMAC resin was loaded with 80 inL of 
starting conditioned media per mL of resin. After 
loading, the column was washed with equilibration 
buffer and most of the contaminating proteins were 
eluted with 35 mM imidazole (pH 7.0) in equilibration 

30 buffer. The soluble OP-1 complex then is eluted with 
50 mM imidazole (pH 8.0) in 20 mM HEPES and 500 mM 
NaCl- 
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The 50 mM imidazole eluate containing the soluble 
OP-1 complex was diluted with nine volumes of 20 mM 
NaP0 4 (pH 7.0) and applied to an S-Sepharose 
(Pharmacia) column equilibrated in 20 mM NaP0 4 (pH 7.0) 
5 with 50 mM NaCl. The S-Sepharose resin was loaded with 
an equivalent of 800 mL of starting conditioned media 
per mL of resin. After loading the S-Sepharose column 
was washed with equilibration buffer and eluted with 
100 mM NaCl followed by 300 mM and 500 mM NaCl in 20 mM 

10 NaP0 4 (pH 7.0). The 300 mM NaCl pool was further 
purified using gel filtration chromatography. Fifty 
mis of the 300 mm NaCl eluate was applied to a 5.0 X 90 
cm Sephacryl S-200HR (Pharmacia) equilibrated in Tris 
buffered saline (TBS) , 50 mM Tris, 150 mM NaCl 

15 (pH 7.4). The column was eluted at a flow rate of 5 
mL/minute collecting 10 mL fractions. The apparent 
molecular of the soluble OP-1 was determined by 
comparison to protein molecular weight standards 
(alcohol dehydrogenase (ADH, 150 kDa), bovine serum 

20 albumin (BSA, 68 kDa), carbonic anhydrase (CA, 30 kDa) 
and cytochrome C (cyt C, 12.5 kDa). The purity of the 
S-200 column fractions was determined by separation on 
standard 15% polyacrylamide SDS gels stained with 
coomassie blue. The identity of the mature OP-1 and 

25 the pro-domain was determined by N- terminal sequence 
analysis after separation of the mature OP-1 from the 
pro-domain using standard reverse phase C18 HPLC. 



The soluble OP-1 complex 
30 molecular weight of 110 kDa. 
predicted composition of the 
one mature OP-1 dimer (35-36 



elutes with an apparent 

This agrees well with the 
soluble OP-1 complex with 
kDa) associated with two 
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pro-domains (39 kDa each). Purity of the final complex 
can be verified by running the appropriate fraction in 
a reduced 15% polyacrylamide gel. 

5 The complex components can be verified by running 

the complex-containing fraction from the S-200 or S- 
200HR columns over a reverse phase C18 HPLC column and 
eluting in an acetonitrile gradient (in 0.1% TFA), 
using standard procedures. The complex is dissociated 

10 by this step, and the pro domain and mature species 

elute as separate species. These separate species then 
can be subjected to N-terminal sequencing using 
standard procedures (see, for example, Guide to 
Protein Purification , M. Deutscher, ed., Academic 

15 Press, San Diego, 1990, particularly pp. 602-613), and 
the identity of the isolated 36kD, 39kDa proteins 
confirmed as mature morphogen and isolated, cleaved pro 
domain, respectively. N-terminal sequencing of the 
isolated pro domain from mammalian cell produced OP-1 

20 revealed 2 forms of the pro region, the intact form 
(beginning at residue 30 of Seq. ID No. 16) and a 
truncated form, (beginning at residue 48 of Seq* ID No. 
16.) N-terminal sequencing of the polypeptide subunit 
of the isolated mature species reveals a range of 

25 N- termini for the mature sequence, beginning at 

residues 293, 300, 313, 315, 316, and 318, of Seq. ID 
No. 16, all of which are active as demonstrated by the 
standard bone induction assay. 
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A. IB. In Vitro Soluble Morphogen Complex Fo rmation 

As an alternative to purifying soluble complexes 
from culture media or a body fluid, soluble complexes 
5 may be formulated from purified pro domains and mature 
dimeric species. Successful complex formation 
apparently requires association of the components under 
denaturing conditions sufficient to relax the folded 
structure of these molecules, without affecting 

10 disulfide bonds. Preferably, the denaturing conditions 
mimic the environment of an intracellular vesicle 
sufficiently such that the cleaved pro domain has an 
opportunity to associate with the mature dimeric 
species under relaxed folding conditions. The 

15 concentration of denaturant in the solution then is 

decreased in a controlled, preferably step-wise manner, 
so as to allow proper refolding of the dimer and pro 
regions while maintaining the association of the pro 
domain with the dimer. Useful denaturants include 4-6M 

20 urea or guanidine hydrochloride (GuHCl), in buffered 
solutions of pH 4-10, preferably pH 6-8. The soluble 
complex then is formed by controlled dialysis or 
dilution into a solution having a final denaturant 
concentration of less than 0.1-2M urea or GuHCl, 

25 preferably 1-2 M urea of GuHCl, which then preferably 
can be diluted into a physiological buffer. Protein 
pur if ication/renaturing procedures and considerations 
are well described in the art, and details for 
developing a suitable renaturing protocol readily can 

30 be determined by one having ordinary skill in the art. 
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One useful text one the subject is Guide to Protein 
Purification , M. Deutscher f ed., Academic Press, San 
Diego f 1990, particularly section V. Complex formation 
also may be aided by addition of one or more chaperone 
5 proteins . 

A.1C. Stability of Soluble Morphoaen Complexes 

The stability of the highly purified soluble 

10 morphogen complex in a physiological buffer, e.g., 
tris-buffered saline (TBS) and phosphate-buffered 
saline (PBS), can be enhanced by any of a number of 
means. Currently preferred is by means of a pro region 
that comprises at least the first 18 amino acids of the 

15 pro sequence (e.g., residues 30-47 of Seq. ID NO. 16 
for OP-1), and preferably is the full length pro 
region. Residues 30-47 show sequence homology to the 
N- terminal portion of other morphogens and are believed 
to have particular utility in enhancing complex 

20 stability for all morphogens. Other useful means for 
enhancing the stability of soluble morphogen complexes 
include three classes of additives. These additives 
include basic amino acids (e.g., L-arginine, lysine and 
betaine); nonionic detergents (e.g., Tween 80 or 

25 Nonldet P-120); and carrier proteins (e.g., serum 
albumin and casein). Useful concentrations of these 
additives include 1-100 mM, preferably 10-70 mM, 
including 50 mM, basic amino acid;, 0.01-1.0%, 
preferably 0.05-0.2%, including 0.1% (v/v) nonionic 

30 detergent;, and 0.01-1.0%, preferably 0.05-0.2%, 
including 0.1% (w/v) carrier protein. 
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B. Considerations for Infant and Other Formulas 
1. Infant Formulas 

5 In all cases the morphogens of this invention 

preferably are added to an infant formula that complies 
with the nutritional guidelines provided by the AAP and 
ESPGAN. Basic ingredients for infant formulas include 
cow's milk, protein, whey proteins, casein and its 

10 salts (i.e. calcium caseinate). Soy protein isolates 
may be substituted for milk-derived proteins, and 
preferably are used in the products made for infants 
with lactose intolerance and/or cow's protein 
intolerance. Protein hydrolyzates (i.e. casein and 

15 lactalbumin hydrolyzates) with low molecular weight, 
also may be used for these products. 

The proportions of the diverse component nutrients 
preferably are similar to those of human milk. Thus, 

20 the ratio of whey proteins to casein preferably varies 
from 60:40 to 70:30 in infant formulas based on milk. 
The mixture of fats employed is made up of edible fats 
to provide an essential fatty acid profile. Lactose 
preferably is used as the carbohydrate source for 

25 at- term newborns infants, and dextrinmaltose preferably 
is employed in products used for the treatment of 
lactose intolerance and malabsorption syndromes in 
infancy . 

30 Infant formulas according to the invention also 

preferably contain minerals (including calcium, 
phosphorus, sodium, potassium, chloride, magnesium, 
iron, zinc, copper, manganese and iodine) and vitamins 
(including vitamin A, vitamin D3, vitamin C, vitamin 
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Bl, vitamin B2, vitamin B6, vitamin B12, pantothenic 
acid, vitamin E, vitamin Kl, folic acid, biotin) 
adequate for the infants' requirements. Also, in the 
products whose source of proteins is derived from soy 
5 or protein isolates or hydrolyzates , carnitine 
preferably is included to satisfy the nutritional 
requirements for this compound in infants with * 
malabsorptive syndromes. 

10 A typical ready-to- feed morphogen-enriched 

formulation for infants, when diluted to feeding 
concentrations, preferably comprises in addition to the 
added morphogen, from about 1-5% by weight fat, from 
about 0.01 to about 0.5% by weight immunoglobulins as 

15 appropriate, from about 4-10% by weight carbohydrate in 
a quantity substantially to mimic the carbohydrate 
content of human mother's milk, from about 0.5 to 4% by 
weight protein in a quantity substantially to mimic the 
protein content of human mother's milk, optional 

20 vitamins and minerals as required, a total solids 
content of from about 8 to 17% by weight, and the 
remainder water. 

A typical protein source for use in infant formula 
25 is electrodialyzed whey or electrodialyzed skim milk or 
milk whey, although other protein sources are also 
available and may be used. Preferred sugars include 
food grade substances such as glucose, dextrose, 
sucrose, or edible lactose. The following vitamins and 
30 minerals may also be incorporated in the infant 
formula: calcium, phosphorus, potassium, sodium, 
chloride, magnesium, manganese, iron, copper, zinc, 
selenium, iodine, and vitamins A, E, D, and B complex. 
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These micronutrients are added in the form of commonly 
accepted nutritional compounds in amounts equivalent to 
those present in human milk on a per calories basis. 

5 The infant formula according to the present 

invention also preferably is sterilized and 
subsequently used on a ready-to-feed basis, or can be 
stored as a concentrate ♦ The concentrate can be 
prepared using standard procedures known in the art, 

10 and the formula can be reconstituted by rehydrating the 
concentrate* The infant formula preferably is a stable 
liquid and has a suitable shelf life, A more detailed 
description of infant formula considerations, including 
preferred formulations for newborn, preterm and low 

15 birth-weight infants, lactose- intolerant infants, may 
be found, for example, in US Pat. No. 5,066,500 to Gil 
et al., the disclosure of which is incorporated herein 
by reference. 

20 2. Other Nutritional Products 

The raorphogen-enriched dietary products for 
balanced nutrition (e.g., dietary food formulations) 
according to the present invention, preferably have, in 

25 addition to added morphogen, a composition of nutrients 
adequate to the specific requirements of not only 
healthy human in need of a balanced nutritional 
product, but also those individuals at risk for lost or 
reduced tissue function due malnutrition-maladsorption 

30 disorder, and/or altered metabolism. Individuals 

particularly affected by an altered metabolic function 
include postmenopausal women or aged individuals, 
hypercatabolic individuals, and individuals undergoing 
periods of rapid growth or physical stress, such as 
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developing juveniles, and pregnant, lactating and 
nursing mothers. Other individuals at risk are those 
suffering from malnutrition, induced, for example, by 
starvation and/or an eating disorder, and individuals 
5 affected with energy-protein malnutrition and in 
hypercatablic states derived from traumatic, septic, 
surgical processes and other clinically-derived 
malabsorption syndromes* 

10 Morphogen-enriched nutritional products according 

to the present invention preferably also provide 
mineral elements which include trace elements and 
vitamins in adequate proportions to satisfy the 
specific requirements of normal healthy individuals as 

15 well as individuals at risk, such as those suffering 
malabsorption-malnutrition processes and in a 
hypercatabolic state. The nutritional products also 
preferably are enriched with amino acids sources, 
vitamins, nucleosides and/or nucleotides in similar 

20 amounts to those present in ordinary foods. 

As described above for infant formulas, liquid 
products may be formulated ready for consumption or as 
concentrates to be diluted before use. Preferably, 
25 liquid dietary compositions have pH values generally 
ranging from about 6.0 to about 8.0, most preferbly 
6.8-7.5. 
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Useful dietary compositions and considerations for 
their formulation are well described in the medical and 
nutritional arts. Useful compositions for clinical 
nutrition, also are described in detail in US 
5 Pat. No. 5,066,500. 

III. Examples 

10 Example 1. Determination of the Presen ce of Morphogen 

in Milk 

Morphogenically active OP-1 was demonstrated to be 
present in mammary gland extract, colostrum, and milk, 

15 as described below. The discovery that the morphogen 
naturally is present in milk, together with the known 
observation that mature, active OP-1 is acid-stable and 
protease-resistant, indicate that oral administration 
is a useful route for therapeutic administration of 

20 morphogen to a mammal. Oral administration typically 
is the preferred mode of delivery for extended or 
prophylactic therapies. In addition, the 
identification of morphogen in all milk forms, 
including colostrum, indicates that the protein plays a 

25 significant role in tissue development, including 
skeletal development of juveniles. 

Rat mammary gland extract and bovine colostrum and 
57 day milk were subjected to purification procedures 
30 designed to partially purify OP-1. The partially 

purified product then was examined for the presence of 
OP-1 by Western blot analysis using OP-l-specif ic 
antisera, and tested for in vivo and in vitro activity. 
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1.1 Purification 

The purification protocol for all three "milk" 
forms (e.g., mammary gland extract, colostrum and 57- 
5 day milk), involved three chromatography steps: (1) 
cation-exchange chromatography (S-Sepharose and 
followed by Phenyl-Sepharose chromatography); (2) 
Copper-Immobilized Metal Affinity chromatography 
(Cu++-IMAC); and finally, (3) C-18 reverse phase 

10 chromatography. Fractions were sampled at each step 
for the presence of OP-1. Fraction samples for testing 
were dialyzed versus water/0.1% TFA, then against 30% 
acetonitrile/0.1% TFA for analysis on SDS- 
polyacrylamide gels and immunoblots, using standard 

15 methodologies well described in the art. Unless 
otherwise stated, the primary antibody used for the 
immunoblots was made against full length OP-1 produced 
in E.coli using standard recombinant DNA and antibody 
production techniques (see, for example, Example 8, 

20 below for a general description for producing 

morphogen-specif ic antibodies.) Fractions found to 
contain the morphogen then were applied to the next 
column step or used in the immunoreactivity or activity 
assays described below. 

25 

Essentially the same protocol was followed for all 
three milk sources, except that two alternative cation- 
exchange methodologies were employed for colostrum 
purification, described in detail below. Unless 
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otherwise indicated, all chemicals referenced are 
standard, commercially available reagents, readily 
available from a number of sources, including Sigma 
Chemical, Co., St. Louis; Calbiochem, Corp., San Diego 
5 and Aldrich Chemcial Co., Milwaukee. 

step 1. Cation-Exchange Chromatography 

The S-Sepharose purification step was performed as 
10 follows. 200ml of cation exchanger (S-Sepharose, Sigma 
Chemical Corp.) were equilibrated with equilibration 
buffer (6M urea, 20mM MES, 70mM NaCl, pH 6.5). The 
supernatant from the centrifuged extract was diluted to 
final concentration of 6M urea, 20mM MES, 70raM NaCl, pH 
15 6.5. After loading, the column was washed to baseline 
using equilibration buffer, and the bound components 
were eluted stepwise from the column with 6M urea, 20mM 
MES, lOOmM and 500mM NaCl, pH 6.5. The more tightly 
bound components then were eluted with 4M guanidine, 
20 20mM sodium phosphate, pH 7.0. 

The Phenyl-Sepharose purification step was 
performed as follows. 15ml of Phenyl-Sepharose CL-4B 
(Sigma) were equilibrated with 6M urea, 20mM HEPES, 1M 

25 ammonium sulfate, 300mM NaCl, pH 7.0. The 500mM NaCl 
eluate from the S-Sepharose step was diluted with 6M 
urea, 20mM HEPES, 3M ammonium sulfate, 300mM NaCl, pH 
7.0, to a final concentration of 1M ammonium sulfate, 
pH 7.0. After loading, the column was washed to 

30 baseline with equilibration buffer. The column was 

eluted with 6M urea, 20mM HEPES, 0.6M ammonium sulfate, 
300mM NaCl, pH 7.0, and then with 4M guanidine, 20mM 
sodium phosphate, pH 7.0. 
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Two alternative cation-exchange chromatography 
schemes (A and B) were exploited in the purification of 
OP-1 from colostrum, as follows. For both schemes, 200 
ml of S-Sepharose (Sigma) was poured into a 5 X 10 cm 
5 Bio-Rad econocolumn (Bio-Rad, Inc. Cambridge.) 

Scheme A: The colostrum, which had been diluted to 
6M urea, 20mM sodium phosphate, pH 7.0, was loaded onto 
a column equilibrated with 6M urea, 20mM sodium 

10 phosphate, 50mM NaCl, pH 7.0. Elution was stepwise, 
with 6M urea, 20mM sodium phosphate, lOOraM and then 
500mM NaCl, pH 7.0; and the final wash was with 4M 
guanidine, 20mM sodium phosphate, pH 7.0. The Phenyl - 
Sepharose column was run as described above, except 

15 that sodium phosphate was used as the running buffer 
instead of HE PES. The Phenyl-Sepharose bound fraction 
(0.0M ammonium sulfate eluate) from scheme A then was 
dialyzed into 6M urea, 20mM Hepes, 500mM NaCl, pH 7.0, 
before it was applied to the CU++-IMAC column, which 

20 was run as described below. 

Scheme B: The alternative S-Sepharose purification 
was performed as follows. Ethanol-precipitated protein 
was loaded onto an S-Sepharose column equilibrated in 

25 6M urea, 20mM MES, 50mM NaCl, pH 6.5. Elution was 
stepwise with 6M urea, 20mM MES, lOOmM NaCl and then 
500mM NaCl, and finally 4M guanidine, 20mM sodium 
phosphate, pH 7.0. The Phenyl-Sepharose column was run 
as described above, with the 0.0M ammonium sulfate 

30 eluate then applied to a Cu++-IMAC column. 
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step 2. CU++IMAC Chromatography 

The Cu++IMAC purification step was performed as 
follows. 10ml of Pharmacia Fast Flow Chelating Resin 
5 were charged with 0.2M cupric sulfate, and equilibrated 
with 6M urea, 20mM HEPES, 0.5M NaCl, pH 7.0. After 
loading, the column was washed to baseline with 
equilibration buffer. Elution from the column was 
stepwise, using equilibration buffer containing ImM, 
10 5mM, or lOmM imidazole. The column then was stripped 
with equilibration buffer containing lOmM EDTA. The 
lOmM imidazole elution was dialyzed against water/0.1% 
TFA, then against 30% acetonitirile/0. 1% TFA. 

15 step 3. Reverse Phase Chromatography 

The C-18 reverse phase chromatography purification 
step was performed as follows. A HPLC C-18 semi-prep 
column was used for the final purification step. The 

20 gradient used was 30-50% acetonitrile/0 . 1% TFA over 60 
minutes at 3ml/minutes. After the sample was loaded, 
the column was washed to baseline with 30% 
acetonitrile/0. 1% TFA before the gradient is started. 
Fractions collected were 3ml in size. Chromatograms 

25 were read at 214 nm. 

(a) OP-1 from Rat Mammary Gland Extract 

Mammary glands were obtained from 2 female Long 
30 Evan rats (Charles River Labs, Wilmington, MA) one week 
post-partum. The excised glands were mildly 
homogenized in 6M urea, 20mM methylethansulfonate 
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(MES), 0.5M NaCI, pH 6,5 using a polytron homogenizer. 
The suspension then was centrifuged for 20 minutes at 
8/000 RPM, and the supernatant removed for further 
purification. 

5 

Following S-Sepharose chromatography, fractions 
containing 6M urea, 20mM MES containing 500mM NaCI, 
also appeared to contain OP-1 as determined by SDS and 
immunoblot, and were applied to the Phenyl-Sepharose 

10 column. The eluate from the 6M urea, 20mM HEPES, 300raM 
NaCI, pH 7.0 elution step from this column were found 
to contain OP-1. This eluate then was applied to a 
Cu++- IMAC column. Eluate fractions found to contain 
OP-1 were then applied to the C-18 column and 

15 chromatographed as described. 

Figure 1(A) shows the chromatogram and 1(B) the 
corresponding Western blot for fractions from the C-18 
reverse phase chromatography step run under reducing 

20 conditions. Lane S of the Western blot is a standard, 
containing reduced, purified, recombinantly-produced 
OP-1. The arrows show molecular weight markers 
corresponding to 17, 27, and 39 Kd. The reduced monomer 
run at approximately 16-18 Kd; the oxidized homodimer 

25 at approximately 36 Kd. Lanes 13-30 represent the 
corresponding fractions of the C-18 reverse phase 
column as numbered in Fig. 1(A). As can be seen in 
Fig. 1(B), mammary extract OP-1 elutes primarily in 
fractions 21-25 from this final chromatography step. 



30 
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(b) OP-1 from Bovine Colostnim 

Colostrum is the first milk to be produced by the 
mother following birth. Approximately 5 gallons of 
5 bovine colostrum were obtained from a local dairy farm 
and delipidated by centrifugation (8000 rpm for 
approximately 10 min. at 4°C). The supernatant then 
was filtered through cheese cloth. The filtered 
supernatant was stored in 500ml aliquot s at 70 °C. 

10 

50 ml of colostrum were diluted with 100ml of 9M 
urea, 30mM sodium phosphate, pH 7.0. Alternately, 50ml 
of colostrum was added to 50ml of 8M guanidine-HCl, 
50mM Tris, pH 7.2 and precipitated with 40%, then 85% 

15 ice cold ethanol. The pellet was washed with 90% cold 
ethanol and lyophilized overnight. The lyophilized 
pellet was resuspended in 6M urea, 20mM MES, 500mM 
NaCl, pH 6.5, stirred overnight at 4°C, and centrifuged 
at 9,000 RPM for 10 minutes to clarify the suspension 

20 before loading onto the column as described in schemes 
A and B, above. 

Following S-Sepharose chromatography by scheme A, 
both the lOOmM and the 500 mM eluate fractions were 

25 found to contain OP-1, with the lOOmM fraction 

containing relatively more morphogen. This fraction 
then was loaded onto the Phenyl-Sepharose column 
following dilution with an equal volume of 6M urea, 20 
mM sodium phosphate, 2M ammonium sulfate, and 300mM 

30 NaCl. 
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Following S-Sepharose chromatography by scheme B, 
the 500mM NaCl eluate was found to contain OP-1 and was 
loaded onto a Phenyl-Sepharose column as described 
above, following dilution with 6M urea, 40mM HE PES, 2M 
5 ammonium sulfate, pH 7.0. 

Following Cu++IMAC chromatography OP-1 was 
identified in the 5mM and lOmM imidazole eluates for 
both purification schemes, and was dialyzed for further 
10 purification on the C-18 column. 

Both purification schemes produce purified OP-1, as 
determined by immunoblot. Figure 2 shows the. 
chromatogram (A) and corresponding Western blot (B) for 

15 results of purification scheme A (Fig. 2B-1, reduced 
and Fig. 2B-2, oxidized); and Figure 3 shows the 
chromatogram (A) and Western blot (B, reduced) for 
C-18-purified protein from scheme B. As for Fig. IB, 
lane S in Figs. 2B and 3B is a standard, containing 

20 purified, recombinantly produced OP-1; 17, 27 and 39 
are molecular weight markers, and lane numbers 
correspond to fraction numbers in the corresponding 
chromatograms . OP-1 purified by scheme A appears 
predominantly in fractions 18-27, and OP-1 purified by 

25 scheme B appears predominantly in fractions 18-25. 

OP-1 from 57-dav milk 

Milk was obtained from the same cow from which the 
30 colostrum came, 57 days after the birth of the calf. 
The milk was delipidated by centrifugation at 10,000 
RPM for 15 minutes, and the milk was poured off and 
away from the fat layer. 
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100ml of milk then were diluted with 200ml of 9M 
urea, 30raM MES, pH 6.5 and loaded onto a 200ml S- 
Sepharose column which had been equilibrated with 6M 
urea, 20mM MES, 50mM NaCl, pH 6.5. Elution was with 6M 
urea, 20mN MES, lOOmM and 500mM NaCl, and 4M guanidine, 
20mM sodium phosphate, pH 7.0. The 500mM elution was 
put over a Phenyl-Sepharose column after being diluted 
with an equal volume of 6M urea, 20mM MES, 2M ammonium 
sulfate, pH 7.0. 



The Phenyl-Sepharose column then was run as 
described above. The Phenyl-Sepharose-bound sample was 
eluted and applied to a Cu++IMAC column, prepared and 
run as described above. The 10mM imidazole eluate was 
15 found to contain OP-1 and was dialyzed for further 
purification on the C-18 column. 

The C-18 reverse phase chromatography column and 
gradient were performed as described above. The 

20 results are presented in Fig. 4A ( chroma togram) and 4B 
(immunoblot, 10B-1, oxidized; 4B-2, reduced.) As 
above, lane S is a standard, containing purified, 
recombinantly produced OP-1; 17, 27, and 39 are 
molecular weight markers, and the lane numbers 

25 correspond to the fractions numbers in Fig. 4A. OP-1 
purified from 57-day milk appears predominantly in 
fractions 18-26. 



1.2 OP-1 Characterization by immunoreactivity 

OP-1 purified from the different milk sources as 
described above also were characterized by Western 
blotting using antibodies raised against OP-1 and BMP2 . 
Antibodies were prepared using standard immunology 
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protocols well known in the art/ and as described in 
Example 8, below using full-length E. coli -produced OP- 
1 and BMP 2 as the immunogens . 

5 As shown in Fig. 5 OP-1 purified from colostrum 

reacts with the anti-OP-1 antibody, but not with 
anti-BMP 2 antibody. In Fig. 5A and B, lane 1 contains 
reduced, purified, recombinantly-produced OP-1; lane 2 
contains C-18 purified bovine colostrum, and lane 3 

10 contains reduced COP-16, a biosynthetic construct 

having morphogenic activity and an amino acid sequence 
modeled on the proteins described herein, but having 
highest amino acid sequence homology with BMP2 (see US 
Pat. No, 5,011,691 for the COP-16 amino acid sequence.) 

15 In Fig. 5A the gel was probed with anti-OP-1 antibody; 
in Fig. 5B, the gel was probed with anti-BMP2 antibody. 
As can be seen in the figure, anti-OP-1 antibody 
hybridizes with protein in lanes 1 and 2, but not 3; 
while anti-BMP2 antibody hybridizes with lane 3 only. 

20 

C-18 purified mammary gland extract and 57-day milk 
also were shown to react with anti-OP-1 antibodies, 
including antibody raised against the full length E. 
coli OP-1, full length mammalian-produced OP-1, and 
25 the OP-1 Ser-17-Cys peptide (e.g., the OP-1 N-terminal 
17 amino acids) • 

1.3 OP-1 Characterization by Activity 

30 The morphogenic activity of OP-1 purified from mammary 
gland extract was evaluated in vivo as follows. 33% of 
each OP-1 immunoreactive fraction of C-18-purif ied 
mammary gland extract was lyophilized and resuspended 
in 220^1 of 50% acetonitrile/0. 1% TFA. After 
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vortexing, 25 mg of collagen matrix was added. The 
samples were lyophilized overnight, and implanted in 
Long Evans rats (Charles River Laboratories, 
Wilmington, MA, 28-35 days old). Each fraction was 
5 implanted in duplicate. For details of the collagen 
matrix implantation procedure, see, for example, U.S. 
Pat. No. 4,968,590, hereby incorporated by reference. 
After 12 days, the implants were removed and evaluated 
for new bone formation by histological observation. 

10 

The results are presented in Fig. 6A, where "% 
activity" refers to the % of bone formation/total area 
covered by bone in the histology sample. In the 
figure, solid bars represent implants using mammary 

15 extract-derived OP-1, where the fraction numbers 

correspond to the related fractions eluted from the 
C-18 reverse phase column (see Fig. IB), and the 
hatched bar represents implants using recombinantly 
produced OP-1 (600 ng). The results demonstrate that 

20 the peak bone forming activity of C-18-purif ied mammary 
gland extract corresponds with the immunoreactive 
fraction peaks of Fig. IB (compare Fig. 6A and IB.) 

Similarly, the morphogenic activity of OP-1 
25 purified from mammary gland extract was evaluated in 
vitro by measuring alkaline phosphatase activity in 
vitro using the following assay. Test samples were 
prepared using 15-20% of individual immunoreactive 
fractions from the C-18 run which were precipitated and 
30 resuspended in a smaller volume of 50% 

acetonitrile/0.1% TFA. Alkaline phosphatase activity 
was tested using ROS 17/2.8 cells (Rat Osteosarcoma, 
e.g., obtained, for example, from Dr. Robert J. 
Majeska, Mt. Sinai Medical Center, New York, NY, in a 
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standard alkaline phosphatase activity assay (see, for 
example, U.S. Pat. No. 4,968,590). The results, 
presented in Fig. 6B, indicate that the immunoreactive 
fractions obtained from C-18-purif ied mammary gland 
5 extract correspond with alkaline phosphatase activity 
in vitro (compare Fig. 6B and Fig. IB.) In Fig. 6B 
solid bars represent assays performed with mammary 
gland-purified OP-1, where the fraction numbers 
correspond to the related fractions eluted from the 

10 C-18 reverse phase column (see Fig. IB), the hatched 
bar represents the assay performed with purified, 
recombinantly-produced OP-1 (lOOng/ml), and the 
cross-hatched bar represents background. As for 
Fig. 6A, alkaline phosphatase activity corresponds with 

15 immunoreactivity of the C-18-purif ied extract (compare 
Fig. 6B and IB. ) 

Example 2. Morphoqen Identification in Human Serum 

20 OP-1 was detected in human serum using the 

following assay. A monoclonal antibody raised against 
mammalian, recombinantly produced OP-1 using standard 
immunology techniques well described in the art and 
described generally in Example 8, was immobilized by 

25 passing the antibody over an activated agarose gel 

(e.g., Affi-Gel™, from Bio-Rad Laboratories, Richmond, 
CA, prepared following manufacturer's instructions), 
and used to purify OP-1 from serum. Human serum then 
was passed over the column and eluted with 3M 

30 K-thiocyanate . K-thiocyanante fractions then were 

dialyzed in 6M urea, 20mM P0 4 , pH 7.0, applied to a C8 
HPLC column, and eluted with a 20 minute, 25-50% 
acetonitrile/0.1% TFA gradient. Mature, recombinantly 
produced OP-1 homodimers elute between 20-22 minutes. 
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Fractions then were collected and tested for the 
presence of OP-1 by standard immunoblot. Fig. 7 is an 
immunoblot showing OP-1 in human sera under reducing 
and oxidized conditions* In the figure, lanes 1 and 4 
5 are OP-1 standards, run under oxidized (lane 1) and 
reduced (lane 4) conditions. Lane 5 shows molecular 
weight markers at 17, 27 and 39 kDa. Lanes 2 and 3 are 
human sera OP-1, run under oxidized (lane 2) and 
reduced (lane 3) conditions. 

10 

Example 3. Morphoqen-Induced CAM Expression 

The morphogens described herein induce CAM 
expression as part of their induction of morphogenesis. 

15 CAMs are morphoregulatory molecules identified in all 
tissues as an essential step in tissue development. N- 
CAMs, which comprise at least 3 isoforms (N-CAM-180, N- 
CAM-140 and N-CAM-120, where "180", "140" and "120" 
indicate the apparent molecular weights of the isoforms 

20 as measured by polyacrylamide gel electrophoresis) are 
expressed at least transiently in developing tissues, 
and permanently in nerve tissue. Both the N-CAM-180 and 
N-CAM-140 isoforms are expressed in both developing and 
adult tissue. The N-CAM-120 isoform is found only in 

25 adult tissue. Another neural CAM is LI. 

CAMs are implicated in normal tissue development; 
N-CAMs are implicated in appropriate neural 
development, including appropriate neurulation, 
30 neuronal migration, f asciculation, and synaptogenesis. 
Inhibition of N-CAM production, as by complex ing the 
molecule with an N-CAM-specif ic antibody, inhibits 
retina organization, including retinal axon migration, 
and axon regeneration in the peripheral nervous system, 
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as well as axon synapsis with target muscle cells. 
CAMs also have been postulated as part of a 
morphoregulatory pathway whose activity is induced by a 
to date unidentified molecule (See, for example, 
5 Edelman, G.M. (1986) Ann. Rev. Cell Biol. 2:81-116). 
Without being limited to any given theory, the 
morphogens described herein may act as the inducer of 
this pathway. 

10 The morphogens described herein can stimulate CAM 

production. As described below, the morphogens 
stimulate Ll and N-CAM production, including all three 
isoforms of the N-CAM molecule, in nerve tissue. 

15 In this example NG108-15 cells were cultured for 

4 days in the presence of increasing concentrations of 
OP-1 and standard Western blots performed on whole 
cells extracts. The NG10875 cell line is a hybrid cell 
line (neuroblastoma x glioma, American Type Culture 

20 Collection, Rockville, MD). N-CAM isoforms were 

detected with an antibody which crossreacts with all 
three isoforms, mAb H28.123, obtained from Sigma 
Chemical Co., St. Louis, the different isoforms being 
distinguishable by their different mobilities on an 

25 electrophoresis gel. Control NG108-15 cells 

(untreated) express both the 140 kDa and the 180 kDa 
isoforms, but not the 120 kDa, as determined by western 
blot analyses using up to 100 f/g of protein. As shownin 
Pig. 8, treatment of NG108-15 cells with OP-1 resulted 

30 in a dose-dependent increase in the expression of the 
180 kDa and 140 kDa isoforms, as well as the induction 
of the 120 kDa isoform. Fig. 8B is a Western blot of 
OPl-treated NG108-15 cell extracts, probed with mAb 
H28.123, showing the induction of all three isoforms. 
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Fig* 8A is a dose response curve of N-CAM-180 and N- 
CAM-140 induction as a function of morphogen 
concentration. N-CAM-120 is not shown in the graph as 
it could not be quantitated in control cells. However, 
as is clearly evident from the Western blot in Fig. 8A, 
N-CAM-120 is induced in response to morphogen 
treatment. The induction of the 120 isoform also 
indicates that morphogen- induced redif ferentiation of 
transformed cells stimulates not only redif ferentiation 
of these cells from a transformed phenotype, but also 
differentiation to a phenotype corresponding to a 
developed cell. The differential induction of N-CAM 
180 and 140 isoforms seen may be because constitutive 
expression of the 140 isoform is close to maximum. In 
addition, the increase in N-CAM expression corresponded 
in a dose-dependent manner with the morphogen induction 
of multicellular aggregates. 

In addition, the cell aggregation effects of OP-1 
20 on NG 108-15 cells can be inhibited with anti-N-CAM 
antibodies or antisense N-CAM oligonucleotides. 
Antisense oligonucleotides can be made synthetically on 
a nucleotide synthesizer, using standard means known in 
the art. Preferably, phosphor othioate oligonucleotides 
25 ("S-oligos") are prepared, to enhance transport of the 
nucleotides across cell membranes. Concentrations of 
both N-CAM antibodies and N-CAM antisense 
oliognucleotides sufficient to inhibit N-CAM induction 
also inhibited formation of multilayered cell 
30 aggregates. Specifically, incubation of morphogen- 
treated NG108-115 cells with 0.3-3 pM N-CAM antisense 
S-oligos, 5-500 //M unmodified N-CAM antisense oligos, 



10 
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or 10 /jg/ml mAb H28.123 significantly inhibits cell 
aggregation. It is likely that morphogen treatment 
also stimulates other CAMs, as inhibition is not 
complete . 

5 

The experiments also have been performed with 
soluble morphogen (e.g., mature OP-1 associated with 
its pro domain) which also specifically induced CAM 
expression. 

10 

Example 4. Effect of Morphogen Neutra lization 
on Embryogenesis 

As described in Example 7, below, at least one 
15 morphogen, OP2, is found principally in early 

developing embryos (8-day embryos). As described 
below, morphogen neutralization with morphogen-specif ic 
antibodies inhibits embryogenesis. 

20 Morphogen inhibition in developing embryos inhibits 

tissue and organ development. Specifically, 9-day 
mouse embryo cells, cultured in vitro under standard 
culturing conditions, were incubated in the presence 
and absence of an OP-l-specif ic monoclonal antibody 

25 prepared using recombinantly produced, purified mature 
OP-1 as the immunogen. The antibody was prepared using 
standard antibody production means well known in the 
art and essentially as described for Example 9, below. 
After two days, the effect of the antibody on the 

30 developing embryo was evaluated by histology using 
standard histology procedures well known in the art. 
As determined by histological examination, the 
OP-l-specific antibody specifically inhibits eye lobe 
formation in the developing embryo. In particular, the 
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diencephalon outgrowth does not develop. In addition, 
the heart is malformed and enlarged. Moreover , in 
separate immunolocalization studies on embryo sections 
with labelled OP-1 specific antibody, the OP-l-specif ic 
5 antibody localizes to neural epithelia. 

Similarly, morphogen activity may be demonstated in 
fetal development in the mouse model using the 
following assay. Single lip injections comprising 
10 10-100/ig/injection of morphogen-specif ic antibody are 
administered to pregnant female mice during each day of 
the gestation period and tissue development (e.g., bone 
development) in treated and control new mice evaluated 
by standard historaoirphometric analysis at birth. 

15 

Finally, stimulation of endogenous morphogen 
antibody production in egg-laying hens interferes with 
shell formation in the developing eggs. 

20 All of these data demonstate that inhibition of 

morphogen activity significantly interferes with tissue 
development during embryogenesis. 

Example 5. Effect of Morphogen Neutralization on 
25 Juvenile Tissue Development 

The effect of the morphogens described herein on 
tissue development in developing mammals also may be 
demonstrated using neutralizing antibodies specific for 
30 particular morphogens and assessing the effect of these 
antibodies on tissue development as described below. 
Specifically/ anti-morphogen monoclonal and/or 
polyclonal antibodies may be prepared using standard 
methodologies including, for example, the protocol' 
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provided in Example 8, below, and provided to juveniles 
to inhibit the activity of endogenous morphogens. 

Generally, purified antibodies are provided 
5 regularly to new born mice, e.g., 10- 

100/jg/inject ion/day for 10-15 days. At 10 or 21 days, 
the mice are sacrificed and the effect of morphogen on 
bone development assessed by body weight, gross visual 
examination and histology. In this example, anti-OP-1 

10 antibodies were used in 10/jg injections/day for 14 
days, and the mice were sacrificed at 21 days. As is 
dramatically demonstrated in Fig. 9, mice treated with 
OP-1 specific antibody show consistent and significant 
stunted growth, including reduced body length and body 

15 weight, (9B) as compared with untreated mice (9A). 

Histological examination showed reduced bone growth as 
evidenced by reduced bone size in the treated mice. 

In a variation on this protocol, single lip 
20 injections also may be provided to older juveniles and 
adult mice (e.g., 10-100 /ig) over a prolonged time 
(e.g., 10-15 days) to evaluate the effect or morphogen 
neutralization on bone growth and bone integrity and to 
evaluate the onset of osteoporosis. 

25 

Example 6. Morphogen Treatment of Osteoporsis 

6.1 Effect of Morphogen on Trabecular Bone in 
Ovariectomized (OVX) Rats 

30 

Aged individuals, and particularly postmenopausal 
women are particularly at risk for osteoporosis. 
Provided below is an animal osteoposis model 
demonstrating the ability of morphogens to subtantially 
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inhibit and/or reduce the tissue damage effects 
associated with osteoporosis, wherein osteoporosis is 
induced by ovary removal in rats. Bone growth is 
evaluated in these animals by measuring serum alkaline 
5 phosphataseans osteocalcin levels in treated an 
untreated rats. 

Forty Long-Evans rats (Charles River Laboratories, 
Wilmington) weighing about 200g each are ovariectomized 

10 (OVX) using standard surgical procedures, and ten rats 
are sham-operated. The ovariectomization of the rats 
produces an osteoporotic condition within the rats as a 
result of decreased estrogen production. Food and 
water are provided ad libitum . Eight days after 

15 ovariectomy, the rats, prepared as described above, 
were divided into five groups: (A), 10 sham-operated 
rats; (B), 10 ovariectomized rats receiving 1 ml of 
phosphate-buffered saline (PBS) i.v. in the tail vein; 
(C) 10 ovariectomized rats receiving about 1 mg of 

20 17PE 2 ( 17-p-estradiol E 2 ) by intravenous injection 
through the tail vein; (D) 9 ovariectomized rats 
receiving daily injections of approximately 2fjg of 
morphogen by tail vein for 22 days; and (E) 9 
ovariectomized rats receiving daily injections of 

25 approximately 20 fjg of morphogen by tail vein for 
22 days. In this example, OP-1 was the morphogen 
tested. 



30 



On the 15th and 21st day of the study, each rat was 
injected with 5 mg of tetracycline, and on day 22, the 
rats were sacrificed. The body weights, uterine 



a 
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weights, serum alkaline phosphate levels, serum calcium 
levels and serum osteocalcin levels then were 
determined for each rat. The results are shown in 
Tables III and IV. 
5 Table III 

Body Weights. Uterine Weights and Alkaline Phosphatase 



10 



15 



Group 

A-SHAK 
B-OVX+PBS 
C-0VX+E2 
D-0VX+0P-1 

(2yg) 
E-0VX+0P-1 
(20 ug) 



Body Weights 
(g) 

250.90 + 17.04 
273.40 + 16.81 

241.66 + 21.54 

266.67 + 10.43 



Uterine Weights 
(g) 

0.4192 + 0.10 
0.1650 + 0.04 
0.3081 + 0.03 
0.1416 + 0.03 



272.40 + 20.48 0.1481 + 0.05 



Alk. Phosphatase 
(U/L) 

43.25 + 6.11 
56.22 + 6.21 
62.66 + 4.11 
58.09 + 12.97 

66.24 + 15.74 



20 



TABLE IV 



Serum Calcium and Serun Osteocalcin Levels 



25 



30 



Group 



A-SHAM 
B-OVX+PBS 
C-0VX+E2 
D-OVX+OP-1 

(2ug) 
E-0VX+0P-1 

(20pg) 



Serum Calcium 
(ng/dl) 

8.82 + 1.65 
8.95 + 1.25 
9.20 + 1.39 
8.77 + 0^95 

8.67 + 1.94 



Serum Osteocalcin 
(ng/ml) 

64.66 + 14.77 
69.01 + 10.20 
67.13 + 17.33 
148.50 + 84.11 

182.42 + 52.11 
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The results presented in Table III and IV show that 
intravenous injection of morphogen into ovariectomized 
rats produces a significant increase in serum alkaline 
phosphatase and serum osteocalcin levels and 
5 demonstrates that systemic administration of the 
morphogen stimulates bone formation in osteoporotic 
bone. 

6.2 Histomorphometric Analysis of Morp hogen on the 
10 Tibia Diaphysis in Ovariectomized f OVX) Rats 

Fifteen female Long-Evans rats weighing about 160 g 
were ovariectomized (OVX) to produce an osteoporotic 
condition and five rats were sham operated (Charles 

15 River Laboratories , Wilmington, MA.) as described for 
Example 8. Food and water were provided ad libitum. 
Twenty-two days after ovariectomy, the rats were 
divided into four groups: (A) sham-operated (1 ml of 
PBS by intravenous injection through tail vein (5 

20 rats); (B) OVX, into which nothing was injected (5 
rats); (C) OVX, receiving about 1 mg of 170E 2 by 
intravenous injection through the tail vein 
(5 rats), and (D) OVX, receiving about 1 /jg of 
morphogen by intravenous injection through the tail 

25 vein (5 rats). In this example, OP-1 was morphogen 
tested. 

The rats were injected daily as described for seven 
days, except no injections were given on the thirteenth 
30 day. The rats then were sacrificed on the nineteenth 
day. The tibial diaphyseal long bones then were 
removed and fixed in ethanol and histomorphometric 
analysis was carried out using standard procedures well 
known in the art. The results are shown in Table V. 



35 



WO 94/03075 



PCT/US93/07190 



- 87 - 
Table V 

(A) (B) (C) (D) 

MEASUREMENT CONTROL OVX OVX + E 2 OVX + 0P-1 

5 Longitudinal Growth 20*2 + 0.3 19-4 + 0,2 4-9 + 0.5 17.9 + 0.9 

Rate (lim/day) 

Cancellous Bone 20.2 + 1.5 13.0 + 1.6 13.7 +2.1 16.6 +JL.8 

Volume (BV/TV, 
10 bone vol/total vol) 

Cancellous Bone 16.2 + 1.8 9.6 + 0.9 11.5 +1.1 12.2 +0.7 

Perimeter (mm) 

15 Labeled Cancellous 35.5 + 1.5 51.9 + 5.6 58.0 + 4.2 39.2 + 1.9 

Perimeter (X) 

Mineral Apposition 1.76 + 0.14 2.25 + 0.16 1.87 + 0.08 1.86 + 0.20 
Rate (yra/day) 



20 



The results presented in Table V confirm the 
results of Example 6.1, namely that intravenous 
injection of OP-1 into ovariectomized rats stimulates 
bone growth for bone which had been lost due to the 

25 drop in estrogen within the individual rat. 

Specifically/ the inhibition of cancellous bone volume 
in OVX rats is repaired by the systemically provided 
morphogen. In addition, in morphogen-treated rats the 
labelled cancellous perimeter and mineral apposition 

30 rate now return to levels measured in the control, 

sham-operated rats. Moreover, morphogen treatment does 
not inhibit longitudinal bone growth, unlike estrogen 
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treatment, which appears to inhibit bone growth 
significantly. Accordingly, systemic administration of 
a morphogen in therapeutically effective concentations 
effectively inhibits loss of bone mass in a mammal 
5 without inhibiting natural bone formation. 

Example 7. Identification of Morphogen-Expressing 
Tissue 

10 Determining the tissue distribution of morphogens 

may be used to identify different morphogens expressed 
in a given tissue, as well as to identify new, related 
morphogens. Tissue distribution also may be used to 
identify useful morphogen-producing tissue for use in 

15 screening and identifying candidate morphogen- 

stimulating agents. The morphogens (or their mRNA 
transcripts) readily are identified in different 
tissues using standard methodologies and minor 
modifications thereof in tissues where expression may 

20 be low. For example, protein distribution may be 
determined using standard Western blot analysis or 
immunofluorescent techniques, and antibodies specific 
to the morphogen or morphogens of interest. Similarly, 
the distribution of morphogen transcripts may be 

25 determined using standard Northern hybridization 
protocols and transcript-specific probes. 

Any probe capable of hybridizing specifically to a 
transcript, and distinguishing the transcript of 
30 interest from other, related transcripts may be used. 
Because the morphogens described herein share such high 
sequence homology in their active, C-terminal domains, 
the tissue distribution of a specific morphogen 
transcript may best be determined using a probe 
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specific for the pro region of the immature protein 
and/or the N-terminal region of the mature protein. 
Another useful sequence is the 3' non-coding region 
flanking and immediately following the stop codon. 
5 These portions of the sequence vary substantially among 
the morphogens of this invention, and accordingly, are 
specific for each protein. For example, a particularly 
useful Vgr-l-specif ic probe sequence is the Pvull-Sacl 
fragment, a 265 bp fragment encoding both a portion of 

10 the untranslated pro region and the N-terminus of the 
mature sequence (see Lyons et al. (1989) PNA S 86:4554- 
4558 for a description of the cDNA sequence). 
Similarly, particularly useful mOP-l-specif ic probe 
sequences are the BstXl-Bgll fragment, a 0.68 Kb 

15 sequence that covers approximately two- thirds of the 
mOP-1 pro region? a StuI-StuI fragment, a 0.2 Kb 
sequence immediately upstream of the 7-cysteine domain? 
and the Earl-Pstl fragment, an 0.3 Kb fragment 
containing a portion of the 3 'untranslated sequence 

20 (See Seq. ID No. 18, where the pro region is defined 
essentially by residues 30-291.) Similar approaches 
may be used, for example, with hOP-1 (Seq. ID No. 16) 
or human or mouse OP-2 (Seq. ID Nos. 20 and 22.) 

25 Using these morphogen-specif ic probes, which may be 

synthetically engineered or obtained from cloned 
sequences, morphogen transcripts can be identified in 
mammalian tissue, using standard methodologies well 
known to those having ordinary skill in the art. 

30 Briefly, total RNA is prepared from various adult 
murine tissues (e.g., liver, kidney, testis, heart, 
brain, thymus and stomach) by a standard methodology 
such as by the method of Chomczyaski et al. ((1987) 
Anal. Biochem 162:156-159) and described below. Poly 
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(A)+ RNA is prepared by using oligo (dT) -cellulose 
chromatography (e.g., Type 7, from Pharmacia LKB 
Biotechnology, Inc.). Poly (A) + RNA (generally 15 /;g) 
from each tissue is fractionated on a 1% 
5 agarose/formaldehyde gel and transferred onto a Nytran 
membrane ( Schleicher & Schuell ) . Following the 
transfer, the membrane is baked at 80 °C and the RNA is 
cross-linked under UV light (generally 30 seconds at 1 
mW/cm 2 ). Prior to hybridization, the appropriate probe 

10 is denatured by heating. The hybridization is carried 
out in a lucite cylinder rotating in a roller bottle 
apparatus at approximately 1 rev/min for approximately 
•15 hours at 37 °C using a hybridization mix of 40% 
formamide, 5 x Denhardts, 5 x SSPE, and 0.1% SDS. 

15 Following hybridization, the non-specific counts are 
washed off the filters in 0.1 x SSPE, 0.1% SDS at 50°C. 

Examples demonstrating the tissue distribution of 
various morphogens, including Vgr-1, OP-1, BMP2, BMP 3 , 

20 BMP4, BMP5, GDF-1, and OP-2 in developing and adult 
tissue are disclosed in international application 
US92/01968 (W092/15323), and in Ozkaynak, et al., 
(1991) Biochem. Biophys. Res. Commn . 179:116-123, and 
Ozkaynak, et al. (1992) f J. Biol. Chem. 267 ; 25220- 

25 25227), the disclosures of which are incorporated 
herein by reference. Using the general probing 
methodology described herein, northern blot 
hybridizations using probes specific for these 
morphogens to probe brain, spleen, lung, heart, liver 

30 and kidney tissue indicate that kidney-related tissue 
appears to be the primary expression source for OP-1, 
with brain, heart and lung tissues being secondary 
sources. Lung tissue appears to be the primary tissue 
expression source for Vgr-1, BMP 5, BMP 4 and BMP3. Lower 
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levels of Vgr-1 also are seen in kidney and heart 
tissue, while the liver appears to be a secondary 
expression source for BMP5, and the spleen appears to 
be a secondary expression source for BMP4. GDF-1 
5 appears to be expressed primarily in brain tissue. 

Of particular relevance to the present application , 
OP-1 also is detected in adult rat stomach and gut 
tissue. Moreover, OP-2 appears to be expressed 
10 primarily in early embryonic tissue. Specifically, 
northern blots of murine embryos and 6-day post-natal 
animals shows abundant OP2 expression in 8-day embryos. 
Expression is reduced significantly in 17-day. embryos 
and is not detected in post-natal animals. 

15 

in addition, labelled soluble OP-1 (iodinated with 
12 5 1, using standard labelling procedures well known in 
the art) and injected into the rat tail vein also is 
localized to the stomach tissue within 30 minutes of 
20 injection. 

Example 8. • Detecting Morphoqenic Prot ein in Solution 
bv Immunoassay 

25 Morphogens are readily detected in solution with a 

standard immunoassay, using a polyclonal or monoclonal 
antibody specific for that protein and standard Western 
blot, ELISA (enzyme-linked immunoabsorbant assay) or 
other immunoassay technique well known in the art. A 

30 currently preferred, exemplary protocol for an ELISA 
assay, as well as means for generating morphogen- 
specific antibody are presented below. Standard 
protocols for antibody production, Western blot and 
other immunoassays also are described, for example, in 
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Molecular Cloning A Laboratory Manual , Sarabrook et al., 
eds. 1989, Cold Spring Harbor Press, Cold Spring 
Harbor, NY. Standard ELISA technique is described, for 
example, by Engvall (1980) Methods Enzymol . 70:419-439. 

5 

8.1 Morphoqen-Specific Antiserum 

Polyclonal antibody was prepared as follows. Each 
rabbit was given a primary immunization of 100 ug/500 

10 /til E. coli -produced OP-1 monomer (amino acids 328-431 
in SEQ ID NO: 5) in 0.1% SDS mixed with 500 yl 

Complete Freund's Adjuvant. The antigen was injected 
subcutaneous ly at multiple sites on the back and flanks 
of the animal. The rabbit was boosted after a month in 

15 the same manner using incomplete Freund's Adjuvant. 
Test bleeds are taken from the ear vein seven days 
later. Two additional boosts and test bleeds were 
performed at monthly intervals until antibody against 
OP-1 was detected in the serum using an ELISA assay. 

20 Then, the rabbit was boosted monthly with 100 j/g of 
antigen and bled (15 ml per bleed) at days seven and 
ten after boosting. 

8.2 Morphogen-Specif ic Antibody 

25 

Monoclonal antibody specific for a given raorphogen 
was prepared as follows. A mouse was given two 
injections of E. coli produced OP-1 monomer. The first 
injection contains 100/jg of OP-1 in complete Freund's 
30 adjuvant and was given subcutaneous ly. The second 
injection contained 50 fjg of OP-1 in incomplete 
adjuvant and was given intraperitoneally. The mouse 
then received a total of 230 pg of OP-1 (amino acids 
307-431 in SEQ ID NO: 5) in four intraperitoneal 
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injections at various times over an eight month period. 
One week prior to fusion, both mice were boosted 
intraperitoneal^ with 100 fjg of OP-1 (307-431) and 30 
M of the N-terminal peptide (Ser293-Asn309-Cys) 
5 conjugated through the added cysteine to bovine serum 
albumin with SMCC crosslinking agent. This boost was 
repeated five days (IP), four days (IP), three days 
(IP) and one day (IV) prior to fusion. The mouse 
spleen cells then were fused to commercially available 

10 myeloma cells at a ratio of 1:1 using PEG 1500 

(Boeringer Mannheim, Germany), and the cell fusion 
plated and screened for OP-l-specif ic antibodies using 
OP-1 (307-431) as antigen. The cell fusion and 
monoclonal screening then were according to standard 

15 procedures well described in standard texts widely 
available in the art e.g., Maniatis et al. Molecular 
Cloning A Laboratory Manual / Cold Spring Harbor Press. 

8.3 Morphoqen ELISA 

20 

1 fjg/100 of affinity-purified polyclonal rabbit 
IgG specific for OP-1 was added to each well of a 
96-well plate and incubated at 37 °C for an hour. The 
wells were washed four times with 0.167M sodium borate 

25 buffer with 0.15 M NaCl (BSB), pH 8.2, containing 0.1% 
Tween 20. To minimize non-specific binding, the wells 
are blocked by filling completely with 1% bovine serum 
albumin (BSA) in BSB and incubating for 1 hour at 37 °C. 
The wells are then washed four times with BSB 

30 containing 0.1% Tween 20. A 100 fjl aliquot of an 
appropriate dilution of each of the test samples of 
cell culture supernatant was added to each well in 
triplicate and incubated at 37 °C for 30 min. After 
incubation, 100 p/1 biotinylated rabbit anti-OP-1 serum 
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(stock solution is about 1 mg/ml and diluted 1:400 in 
BSB containing 1% BSA before use) are added to each 
well and incubated at 37°C for 30 min. The wells were 
then washed four times with BSB containing 0.1% Tween 
5 20. 100 pi strepavidin-alkaline (Southern 

Biotechnology Associates/ Inc. Birmingham/ Alabama, 
diluted 1:2000 in BSB containing 0.1% Tween 20 before 
use) was added to each well and incubated at 37 °C for 
30 min. The plates were washed four times with 0.5M 

10 Tris buffered Saline ( TBS ) , pH 7.2. 50pl substrate 
(ELISA Amplification System Kit/ Life Technologies, 
Inc./ Bethesda, MD) was added to each well incubated at 
room temperature for 15 min. Then, 50 pi amplifier 
(from the same amplification system kit) is added and 

15 incubated for another 15 min at room temperature. The 
reaction was stopped by the addition of 50 pi 0.3 M 
sulphuric acid. The OD at 490 nm of the solution in 
each well was recorded. To quant itate OP-1 in culture 
media, an OP-1 standard curve was performed in parallel 

20 with the test samples. 

The invention may be embodied in other specific 
forms without departing from the spirit or essential 
characteristics thereof. The present embodiments are 

25 therefore to be considered in all respects as 

illustrative and not restrictive, the scope of the 
invention being indicated by the appended claims rather 
than by the foregoing description, and all changes 
which come within the meaning and range of equivalency 

30 of the claims are therefore intended to be embraced 
therein. 
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SEQUENCE LISTING 



5 (1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: CREATIVE BIOMOLECULES , INC. 

(B) STREET: 35 SOUTH STREET 

(C) CITY: HOPKTNTON 

(D) STATE: MA 

(E) COUNTRY: USA 

(F) POSTAL CODE (ZIP): 01748 

(G) TELEPHONE: 1-508-435-9001 

(H) TELEFAX: 1-508-435-0454 

(I) TELEX: 

(ii) TITLE OF INVENTION: MORPHOGEN-ENRICHED DIETARY COMPOSITION 
20 (iii) NUMBER OF SEQUENCES: 33 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: CREATIVE BIOMOLECULES , INC. 

(B) STREET: 35 SOUTH STREET 
25 (C) CITY: HOPKTNTON 

(D) STATE: HA 

(E) COUNTRY: USA 

(F) ZIP: 01748 

COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release 01.0, Version #1.25 

ATTORNEY/AGENT INFORMATION: 

(A) NAME: KELLEY, ROBIN D. 

(B) REGISTRATION NUMBER: 34,637 

(C) REFERENCE/DOCKET NUMBER: CRP-071 

TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 617/248-7000 

(B) TELEFAX: 617/248-7100 

45 (2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 amino acids 

(B) TYPE: amino acid 

50 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



10 



15 



30 (v) 



35 

(viii) 



40 
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(ii) MOLECULE TYPE: protein 



(ix) FEATURE: 
5 (A) NAME/KEY: Protein 

(B) LOCATION: 1..97 

(0) OTHER INFORMATION: /label- GENERIC-SEQ1 

/note= "WHEREIN EACH XAA INDEPENDENTLY INDICATES 
ONE OF THE 20 NATURALLY-OCCURING L-ISOMER, A-AHINO 
10 ACIDS, OR A DERIVATIVE THEREOF." 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

15 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

15 10 15 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys Xaa Xaa Xaa Cys Xaa Xaa Xaa 
20 25 30 

20 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

35 40 45 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys Cys Xaa Xaa 
25 50 55 60 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
65 70 75 80 

30 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys Xaa Cys 

85 90 95 

Xaa 

35 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 97 amino acids 
40 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

45 
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(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..97 

(D) OTHER INFORMATION: /labels GENERIC-SEQ2 
5 /note= "WHEREIN EACH XAA INDEPENDENTLY INDICATES 

ONE OF THE 20 NATURALLY OCCURING L- ISOMER A-AMINO 
ACIDS, OR A DERIVATIVE THEREOF." 

10 (xl) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
15 10 15 

15 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys Xaa Xaa Xaa Cys Xaa Xaa Xaa 

20 25 30 



20 



Xaa Xaa Xaa Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
35 40 45 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys Cys Xaa Xaa 
50 55 60 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
25 65 70 75 80 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys Xaa Cys 
85 90 95 

30 Xaa 

(2) INFORMATION FOR SEQ ID NO: 3: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



40 



(ii) MOLECULE TYPE: protein 
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(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..97 

(D) OTHER INFORMATION: /label* GENERIC-SEQ3 
5 /note= "WHEREIN EACH XAA IS INDEPENDENTLY SELECTED 

FROM A GROUP OF ONE OR MORE SPECIFIED AMINO ACIDS 
AS DEFINED IN THE SPECIFICATION. " 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Leu Tyr Val Xaa Phe Xaa Xaa Xaa Gly Trp Xaa Xaa Trp Xaa Xaa Ala 
15 10 15 

15 Pro Xaa Gly Xaa Xaa Ala Xaa Tyr Cys Xaa Gly Xaa Cys Xaa Xaa Pro 

20 25 30 



20 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Asn His Ala Xaa Xaa Xaa Xaa Leu 

35 40 « 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys Cys Xaa Pro 
50 55 60 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Leu Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
25 65 70 75 80 

Val Xaa Leu Xaa Xaa Xaa Xaa Xaa Met Xaa Val Xaa Xaa Cys Gly Cys 
85 90 95 

30 Xaa 

(2) INFORMATION FOR SEQ ID NO: 4: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

40 

(ii) MOLECULE TYPE: protein 
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(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..102 

(D) OTHER INFORMATION: /label= GENERIC-SEQ4 
5 /note= "WHEREIN EACH XAA IS INDEPENDENTLY SELECTED 

FROM A GROUP OF ONE OR HORE SPECIFIED AMINO ACIDS 
AS DEFINED IN THE SPECIFICATION." 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 

Cys Xaa Xaa Xaa Zaa Leu Tyr Val Xaa Phe Xaa Xaa Xaa Gly Trp Xaa 
1 5 10 15 

15 Xaa Trp Xaa Xaa Ala Pro Xaa Gly Xaa Xaa Ala Xaa Tyr Cys Xaa Gly 

20 25 30 

Xaa Cys Xaa Xaa Pro Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Asn His Ala 
35 40 45 

Xaa Xaa Xaa Xaa Leu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
50 55 60 

Xaa Cys Cys Xaa Pro Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Leu Xaa Xaa 
25 65 70 75 80 

Xaa Xaa Xaa Xaa Xaa Val Xaa Leu Xaa Xaa Xaa Xaa Xaa Met Xaa Val 
85 90 95 

30 Xaa Xaa Cys Gly Cys Xaa 

100 

(2) INFORMATION FOR SEQ ID NO: 5: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 139 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



20 



40 



(ii) MOLECULE TYPE: protein 



(vi) ORIGINAL SOURCE: 

(A) ORGANISH: Homo sapiens 
45 (F) TISSUE TYPE: HIPPOCAMPUS 
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(ix) FEATURE: 

(A) NAME /KEY: Protein 

(B) LOCATION: 1. .139 

(D) OTHER INFORMATION: /label* hOPl-MATURE 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Ser Thr Gly Ser Lys Gin Arg Ser Gin Asn Arg Ser Lys Thr Pro Lys 
10 1 5 10 15 

Asn Gin Glu Ala Leu Arg Met Ala Asn Val Ala Glu Asn Ser Ser Ser 
20 25 30 

15 Asp Gin Arg Gin Ala Cys Lys Lys His Glu Leu Tyr Val Ser Phe Arg 

35 AO 45 



20 



35 



Asp Leu Gly Trp Gin Asp Trp He He Ala Pro Glu Gly Tyr Ala Ala 
50 55 60 

Tyr Tyr Cys Glu Gly Glu Cys Ala Phe Pro Leu Asn Ser Tyr Met Asn 
65 70 75 80 



Ala Thr Asn His Ala He Val Gin Thr Leu Val His Phe He Asn Pro 
25 85 90 95 

Glu Thr Val Pro Lys Pro Cys Cys Ala Pro Thr Gin Leu Asn Ala He 
100 105 HO 

30 Ser Val Leu Tyr Phe Asp Asp Ser Ser Asn Val He Leu Lys Lys Tyr 

115 120 125 



Arg Asn Met Val Val Arg Ala Cys Gly Cys His 
130 135 

(2) INFORMATION FOR SEQ ID NO: 6: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 139 amino acids 
40 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

45 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: MURIDAE 
(F) TISSUE TYPE: EMBRYO 
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(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..139 

(D) OTHER INFORMATION: /label* MOP 1 -MATURE 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Ser Thr Gly Gly Lys Gin Arg Ser Gin Asn Arg Ser Lys Thr Pro Lys 
15 10 15 

Asn Gin Glu Ala Leu Arg Met Ala Ser Val Ala Glu Asn Ser Ser Ser 
20 25 30 

15 Asp Gin Arg Gin Ala Cys Lys Lys His Glu Leu Tyr Val Ser Phe Arg 

35 40 A5 

Asp Leu Gly Trp Gin Asp Trp He He Ala Pro Glu Gly Tyr Ala Ala 
50 55 60 

20 

Tyr Tyr Cys Glu Gly Glu Cys Ala Phe Pro Leu Asn Ser Tyr Met Asn 
65 70 75 80 

Ala Thr Asn His Ala He Val Gin Thr Leu Val His Phe He Asn Pro 
25 85 90 95 

Asp Thr Val Pro Lys Pro Cys Cys Ala Pro Thr Gin Leu Asn Ala He 
100 105 HO 

30 Ser Val Leu Tyr Phe Asp Asp Ser Ser Asn Val He Leu Lys Lys Tyr 

115 120 125 



35 



Arg Asn Met Val Val Arg Ala Cys Gly Cys His 
130 135 

(2) INFORMATION FOR SEQ ID NO: 7: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 139 amino acids 
40 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

45 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: HOMO SAPIENS 
(F) TISSUE TYPE: HIPPOCAMPUS 
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(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1. .139 

(D) OTHER INFORMATION: /label* HOP 2 -MATURE 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Ala Val Are Pro Leu Arg Arg Arg Gin Pro Lys Lys Ser Asn Glu Leu 
10 1 5 10 15 

Pro Gin Ala Asn Arg Leu Pro Gly He Phe Asp Asp Val His Gly Ser 
20 25 30 

15 His Gly Arg Gin Val Cys Arg Arg His Glu Leu Tyr Val Ser Phe Gin 

35 40 45 

Asp Leu Gly Trp Leu Asp Trp Val He Ala Pro Gin Gly Tyr Ser Ala 
50 55 60 

Tyr Tyr Cys Glu Gly Glu Cys Ser Phe Pro Leu Asp Ser Cys Met Asn 
65 70 75 80 

Ala Thr Asn His Ala He Leu Gin Ser Leu Val His Leu Met Lys Pro 
25 85 90 95 

Asn Ala Val Pro Lys Ala Cys Cys Ala Pro Thr Lys Leu Ser Ala Thr 
100 105 HO 

30 Ser Val Leu Tyr Tyr Asp Ser Ser Asn Asn Val He Leu Arg Lys His 

115 120 125 



20 



35 



Arg Asn Met Val Val Lys Ala Cys Gly Cys His 
130 135 

-(2) INFORMATION FOR SEQ ID NO: 8: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 139 amino acids 
40 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



45 



(ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: MURIDAE 
(F) TISSUE TYPE: EMBRYO 
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(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..139 

(D) OTHER INFORMATION: /label- MOP 2 -MATURE 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Ala Ala Are Pro Leu Lys Arg Are Gin Pro Lys Lys Thr Asn Glu Leu 
10 1 5 10 15 

Pro His Pro Asn Lys Leu Pro Gly lie Phe Asp Asp Gly His Gly Ser 
20 25 30 

15 Arg Gly Arg Glu Val Cys Arg Arg His Glu Leu Tyr Val Ser Phe Arg 

35 40 45 



20 



35 



Asp Leu Gly Trp Leu Asp Trp Val He Ala Pro Gin Gly Tyr Ser Ala 
50 55 60 

Tyr Tyr Cys Glu Gly Glu Cys Ala Phe Pro Leu Asp Ser Cys Met Asn 
65 70 75 80 



Ala Thr Asn His Ala He Leu Gin Ser Leu Val His Leu Met Lys Pro 
25 85 90 95 

Asp Val Val Pro Lys Ala Cys Cys Ala Pro Thr Lys Leu Ser Ala Thr 
100 105 110 

30 Ser Val Leu Tyr Tyr Asp Ser Ser Asn Asn Val He Leu Arg Lys His 

115 120 125 



Arg Asn Met Val Val Lys Ala Cys Gly Cys His 
130 135 

(2) INFORMATION FOR SEQ ID NO: 9: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 101 amino acids 
40 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

45 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: bovinae 
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(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..101 

(D) OTHER INFORMATION: /label- CBHP-2A-FX 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:9: 

Cvs Lys Arg His Pro Leu Tyr Val Asp Phe Ser Asp Val Gly Trp Asn 
10 1 5 10 15 

Asp Trp lie Val Ala Pro Pro Gly Tyr His Ala Phe Tyr Cys His Gly 
20 25 30 

15 Glu Cys Pro Phe Pro Leu Ala Asp His Leu Asn Ser Thr Asn His Ala 

35 40 45 

He Val Gin Thr Leu Val Asn Ser Val Asn Ser Lys He Pro Lys Ala 
50 55 60 

Cys Cys Val Pro Thr Glu Leu Ser Ala He Ser Met Leu Tyr Leu Asp 
65 70 75 80 

Glu Asn Glu Lys Val Val Leu Lys Asn Tyr Gin Asp Met Val Val Glu 
85 90 95 

Gly Cys Gly Cys Arg 
100 



20 



25 



30 (2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 101 amino acids 

(B) TYPE: amino acid 

35 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: HOMO SAPIENS 
(F) TISSUE TYPE: hippocampus 

(ix) FEATURE: 
45 (A) NAME/KEY: Protein 

(B) LOCATION: 1..101 

(D) OTHER INFORMATION: /label= CBHP-2B-FX 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Cys Arg Arc His Ser Leu Tyr Val Asp Pbe Ser Asp Val Gly Trp Asn 
15 10 15 

5 

Asp Trp lie Val Ala Pro Pro Gly Tyr Gin Ala Phe Tyr Cys His Gly 
20 25 30 

Asp Cys Pro Phe Pro Leu Ala Asp His Leu Asn Ser Thr Asn His Ala 
10 35 40 45 

He Val Gin Thr Leu Val Asn Ser Val Asn Ser Ser He Pro Lys Ala 
50 55 60 

15 Cys Cys Val Pro Thr Glu Leu Ser Ala He Ser Het Leu Tyr Leu Asp 

65 70 75 80 



20 



25 



Glu Tyr Asp Lys Val Val Leu Lys Asn Tyr Gin Glu Het Val Val Glu 
85 90 95 

Gly Cys Gly Cys Arg 
100 

(2) INFORMATION FOR SEQ ID NO: 11: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE i amino acid 

(C) STRAND EDNESS: single 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: DROSOPHILA HELANOGASTER 

(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..101 

40 (D) OTHER INFORMATION: /label= DPP-FX 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

45 Cys Arg Arg His Ser Leu Tyr Val Asp Phe Ser Asp Val Gly Trp Asp 

15 10 15 

Asp Trp He Val Ala Pro Leu Gly Tyr Asp Ala Tyr Tyr Cys His Gly 
20 25 30 

50 
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Lys Cys Pro Phe Pro Leu Ala Asp His Phe Asn Ser Thr Asn His Ala 
35 40 45 

Val Val Gin Thr Leu Val Asn Asn Asn Asn Pro Gly Lys Val Pro Lys 
5 50 55 60 

Ala Cys Cys Val Pro Thr Gin Leu Asp Ser Val Ala Met Leu Tyr Leu 
65 70 75 80 

10 Asn Asp Gin Ser Thr Val Val Leu Lys Asn Tyr Gin Glu Met Thr Val 

85 90 95 



15 



Val Gly Cys Gly Cys Arg 
100 

(2) INFORMATION FOR SEQ ID NO: 12: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 
20 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



25 



(ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: XENOPUS 



(ix) FEATURE: 
30 (A) NAME/KEY: Protein 

(B) LOCATION: 1..102 
(D) OTHER INFORMATION: /label= VGL-FX 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Cys Lys Lys Arg His Leu Tyr Val Glu Phe Lys Asp Val Gly Trp Gin 
15 10 15 

40 Asn Trp Val He Ala Pro Gin Gly Tyr Met Ala Asn Tyr Cys Tyr Gly 

20 25 30 



45 



Glu Cys Pro Tyr Pro Leu Thr Glu He Leu Asn Gly Ser Asn His Ala 
35 40 45 

He Leu Gin Thr Leu Val His Ser He Glu Pro Glu Asp He Pro Leu 
50 55 60 



Pro Cys Cys Val Pro Thr Lys Met Ser Pro He Ser Met Leu Phe Tyr 
50 65 70 75 80 
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Asp Asn Asn Asp Asn Val Val Leu Arg His Tyr Glu Asn Met Ala Val 
y 85 90 95 

Asp Glu Cys Gly Cys Arg 
5 100 

(2) INFORMATION FOR SEQ ID NO: 13: 

.(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 102 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: HURIOAE 

20 (ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..102 

(D) OTHER INFORMATION: /label- VGR-l-FX 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Cys Lys Lys His Glu Leu Tyr Val Ser Phe Gin Asp Val Gly Trp Gin 
15 10 15 

Asp Trp lie He Ala Pro Lys Gly Tyr Ala Ala Asn Tyr Cys Asp Gly 
20 25 30 

Glu Cys Ser Phe Pro Leu Asn Ala His Met Asn Ala Thr Asn His Ala 
35 35 AO 45 

He Val Gin Thr Leu Val His Val Met Asn Pro Glu Tyr Val Pro Lys 
50 55 60 

40 Pro Cys Cys Ala Pro Thr Lys Val Asn Ala He Ser Val Leu Tyr Phe 

65 70 75 80 



30 



45 



Asp Asp Asn Ser Asn Val He Leu Lys Lys Tyr Arg Asn Met Val Val 
85 90 95 

Arg Ala Cys Gly Cys His 
100 
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(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 106 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 
(F) TISSUE TYPE: brain 

(ix) FEATURE: 
20 (A) NAME/KEY: Protein 

(B) LOCATION: 1--106 

(D) OTHER INFORMATION: /note* "GDF-l (fx)" 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Cys Arg Ala Arg Arg Leu Tyr Val Ser Phe Arg Glu Val Gly Trp His 
15 10 15 

30 Arg Trp Val He Ala Pro Arg Gly Phe Leu Ala Asn Tyr Cys Gin Gly 

20 25 30 



35 



Gin Cys Ala Leu Pro Val Ala Leu Ser Gly Ser Gly Gly Pro Pro Ala 
35 40 45 

Leu Asn His Ala Val Leu Arg Ala Leu Met His Ala Ala Ala Pro Gly 
50 55 60 



Ala Ala Asp Leu Pro Cys Cys Val Pro Ala Arg Leu Ser Pro He Ser 
40 65 70 75 80 

Val Leu Phe Phe Asp Asn Ser Asp Asn Val Val Leu Arg Gin Tyr Glu 
85 90 95 

45 Asp Met Val Val Asp Glu Cys Gly Cys Arg 

100 105 
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(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 5 amino acids 
5 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

10 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

15 Cys Xaa Xaa Xaa Xaa 

1 5 

(2) INFORMATION FOR SEQ ID NO: 16: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1822 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

25 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

30 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: HOMO SAPIENS 
(F) TISSUE TYPE: HIPPOCAMPUS 

35 

_/ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 49.. 1341 

(C) IDENTIFICATION METHOD: experimental 

40 (D) OTHER INFORMATION: /functions "OSTEOGENIC PROTEIN" 

/products "OP1" 
/evidence^ EXPERIMENTAL 
/standard_name= "0P1" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
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GGTGCGGGCC CGGAGCCCGG AGCCCGGGTA GCGCGTAGAG CCGGCGCG ATG CAC GTG 57 

Met His Val 
1 

5 CGC TCA CTG CGA GCT GCG GCG CCG CAC AGC TIC GTG GCG CTC TGG GCA 105 
Are Ser Leu Arg Ala Ala Ala Pro His Ser Phe Val Ala Leu Trp Ala 
5 10 15 

CCC CTG TTC CTG CTG CGC TCC GCC CTG GCC GAC TTC AGC CTG GAC AAC 153 
10 Pro Leu Phe Leu Leu Arg Ser Ala Leu Ala Asp Phe Ser Leu Asp Asn 
20 25 30 35 

GAG GTG CAC TCG AGC TTC ATC CAC CGG CGC CTC CGC AGC CAG GAG CGG 201 
Glu Val His Ser Ser Phe He His Arg Arg Leu Arg Ser Gin Glu Arg 
15 40 45 50 

CGG GAG ATG CAG CGC GAG ATC CTC TCC ATT TTG GGC TTG CCC CAC CGC 249 
Arg Glu Met Gin Arg Glu He Leu Ser He Leu Gly Leu Pro His Arg 
55 60 65 

20 

CCG CGC CCG CAC CTC CAG GGC .AAG CAC AAC TCG GCA CCC ATG TTC ATG 297 
Pro Arg Pro His Leu Gin Gly Lys His Asn Ser Ala Pro Met Phe Met 
70 75 80 

25 CTG GAC CTG TAC AAC GCC ATG GCG GTG GAG GAG GGC GGC GGG CCC GGC 345 
Leu Asp Leu Tyr Asn Ala Met Ala Val Glu Glu Gly Gly Gly Pro Gly 
85 90 95 

GGC CAG GGC TTC TCC TAC CCC TAC AAG GCC GTC TTC AGT ACC CAG GGC 393 
30 Gly Gin Gly Phe Ser Tyr Pro Tyr Lys Ala Val Phe Ser Thr Gin Gly 
100 105 HO 115 

CCC CCT CTG GCC AGC CTG CAA GAT AGC CAT TTC CTC ACC GAC GCC GAC 441 
Pro Pro Leu Ala Ser Leu Gin Asp Ser His Phe Leu Thr Asp Ala Asp 
35 120 125 130 

ATG GTC ATG AGC TTC GTC AAC CTC GTG GAA CAT GAC AAG GAA TTC TTC 489 
Met Val Met Ser Phe Val Asn Leu Val Glu His Asp Lys Glu Phe Phe 
135 140 145 



40 



CAC CCA CGC TAC CAC CAT CGA GAG TTC CGG TTT GAT CTT TCC AAG ATC 537 
His Pro Arg Tyr His His Arg Glu Phe Arg Phe Asp Leu Ser Lys He 
150 155 160 



45 CCA GAA GGG GAA GCT GTC ACG GCA GCC GAA TTC CGG ATC TAC AAG GAC 585 
Pro Glu Gly Glu Ala Val Thr Ala Ala Glu Phe Arg He Tyr Lys Asp 
165 170 175 
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TAC ATC CGG GAA CGC TTC GAC AAT GAG ACG TTC CGG ATC AGC GTT TAT 
Tyr He Arg Glu Are Phe Asp Asn Glu Tbr The Arg He Ser Val Tyr 
180 185 190 195 

5 CAG GTG CTC CAG GAG CAC TTG GGC AGG GAA TCG GAT CTC TTC CTG CTC 
Gin Val Leu Gin Glu His Leu Gly Arg Glu Ser Asp Leu Phe Leu Leu 
200 205 210 

GAC AGC CGT ACC CTC TGG GCC TCG GAG GAG GGC TGG CTG GTG TTT GAC 
10 Asp Ser Arg Thr Leu Tip Ala Ser Glu Glu Gly Trp Leu Val Phe Asp 
215 220 225 

ATC ACA GCC ACC AGC AAC CAC TGG GTG GTC AAT CCG CGG CAC AAC CTG 
He Thr Ala Thr Ser Asn His Trp Val Val Asn Pro Arg His Asn Leu 
230 235 2A0 

GGC CTG CAG CTC TCG GTG GAG ACG CTG GAT GGG CAG AGC ATC AAC CCC 
Gly Leu Gin Leu Ser Val Glu Thr Leu Asp Gly Gin Ser He Asn Pro 
245 250 255 

AAG TTG GCG GGC CTG ATT GGG CGG CAC GGG CCC CAG AAC AAG CAG CCC 
Lys Leu Ala Gly Leu He Gly Arg His Gly Pro Gin Asn Lys Gin Pro 
260 265 270 275 

TTC ATG GTG GCT TTC TTC AAG GCC ACG GAG GTC CAC TTC CGC AGC ATC 
Phe Met Val Ala Phe Phe Lys Ala Thr Glu Val His Phe Arg Ser He 
280 285 290 



15 



20 



25 



CGG TCC ACG GGG AGC AAA CAG CGC AGC CAG AAC CGC TCC AAG ACG CCC 
30 Arg Ser Thr Gly Ser Lys Gin Arg Ser Gin Asn Arg Ser Lys Thr Pro 
295 300 305 

AAG AAC CAG GAA GCC CTG CGG ATG GCC AAC GTG GCA GAG AAC AGC AGC 
Lys Asn Gin Glu Ala Leu Arg Met Ala Asn Val Ala Glu Asn Ser Ser 
35 310 315 320 

AGC GAC CAG AGG CAG GCC TGT AAG AAG CAC GAG CTG TAT GTC AGC TTC 

Ser Asp Gin Arg Gin Ala Cys Lys Lys His Glu Leu Tyr Val Ser Phe 
325 330 335 

40 

CGA GAC CTG GGC TGG CAG GAC TGG ATC ATC GCG CCT GAA GGC TAC GCC 

Arg Asp Leu Gly Trp Gin Asp Trp He He Ala Pro Glu Gly Tyr Ala 

340 345 350 355 

45 GCC TAC TAC TGT GAG GGG GAG TGT GCC TTC CCT CTG AAC TCC TAC ATG 
Ala Tyr Tyr Cys Glu Gly Glu Cys Ala Phe Pro Leu Asn Ser Tyr Met 
360 365 370 



633 



681 



729 



777 



825 



873 



921 



969 



1017 



1065 



1113 



1161 
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AAC GCC ACC AAC CAC GCC ATC GTG CAG ACG CTG GTC CAC TTC ATC AAC 1209 
Asn Ala Thr Asn His Ala He Val Gin Thr Leu Val His Phe He Asn 
375 380 385 

5 CCG GAA ACG GTG CCC AAG CCC TGC TGT GCG CCC ACG CAG CTC AAT GCC 1257 
Pro Glu Thr Val Pro Lys Pro Cys Cys Ala Pro Thr Gin Leu Asn Ala 
390 395 400 

ATC TCC GTC CTC TAC TTC GAT GAC AGC TCC AAC GTC ATC CTG AAG AAA 1305 
10 He Ser Val Leu Tyr Phe Asp Asp Ser Ser Asn Val He Leu Lys Lys 
405 410 415 

TAC AGA AAC ATG GTG GTC CGG GCC TGT GGC TGC CAC TAGCTCCTCC 1351 
Tyr Arg Asn Met Val Val Arg Ala Cys Gly Cys His 
15 420 425 430 

GAGAATTCAG ACCCTTTGGG GCCAAGTTTT TCTGGATCCT CCATTGCTCG CCTTGGCCAG 1411 

GAACCAGCAG ACCAACTGCC TTTTGTGAGA CCTTCCCCTC CCTATCCCCA ACTTTAAAGG 1471 

20 

TGTGAGAGTA TTAGGAAACA TGAGCAGCAT ATGGCTTTTG ATCAGTTTTT CAGTGGCAGC 1531 

ATCCAATGAA CAAGATCCTA CAAGCTGTGC AGGCAAAACC TAGCAGGAAA AAAAAACAAC 1591 

25 GCATAAAGAA AAATGGCCGG GCCAGGTCAT TGGCTGGGAA GTCTCAGCCA TGCACGGACT 1651 

CGTTTCCAGA GGTAATTATG AGCGCCTACC AGCCAGGCCA CCCAGCCGTG GGAGGAAGGG 1711 

GGCGTGGCAA GGGGTGGGCA CATTGGTGTC TGTGCGAAAG GAAAATTGAC CCGGAAGTTC 1771 

CTGTAATAAA TGTCACAATA AAACGAATGA ATGAAAAAAA AAAAAAAAAA A 1822 



30 



35 



40 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 431 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 



45 Met His Val Arg Ser Leu Arg Ala Ala Ala Pro His Ser Phe Val Ala 
15 10 15 

Leu Trp Ala Pro Leu Phe Leu Leu Arg Ser Ala Leu Ala Asp Phe Ser 
20 25 30 

50 
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Leu Asp Asn Glu Val His Ser Ser Phe He His Arg Arg Leu Arg Ser 
35 AO 45 

Gin Glu Arg Arg Glu Met Gin Arg Glu He Leu Ser He Leu Gly Leu 
5 50 55 60 

Pro His Arg Pro Arg Pro His Leu Gin Gly Lys His Asn Ser Ala Pro 
65 70 75 80 

10 Met Phe Met Leu Asp Leu Tyr Asn Ala Met Ala Val Glu Glu Gly Gly 

85 90 95 



15 



Gly Pro Gly Gly Gin Gly Phe Ser Tyr Pro Tyr Lys Ala Val Phe Ser 
100 105 HO 

Thr Gin Gly Pro Pro Leu Ala Ser Leu Gin Asp Ser His Phe Leu Thr 
115 120 125 



Asp Ala Asp Met Val Met Ser Phe Val Asn Leu Val Glu His Asp Lys 
20 130 135 1*0 

Glu Phe Phe His Pro Arg Tyr His His Arg Glu Phe Arg Phe Asp Leu 
145 150 155 160 

25 Ser Lys He Pro Glu Gly Glu Ala Val Thr Ala Ala Glu Phe Arg He 

165 170 175 



30 



Tyr Lys Asp Tyr He Arg Glu Arg Phe Asp Asn Glu Thr Phe Arg He 
180 185 190 

Ser Val Tyr Gin Val Leu Gin Glu His Leu Gly Arg Glu Ser Asp Leu 
195 200 205 



Phe Leu Leu Asp Ser Arg Thr Leu Trp Ala Ser Glu Glu Gly Trp Leu 
35 210 215 220 

Val Phe Asp He Thr Ala Thr Ser Asn His Trp Val Val Asn Pro Arg 
225 230 235 240 

40 His Asn Leu Gly Leu Gin Leu Ser Val Glu Thr Leu Asp Gly Gin Ser 

245 250 255 



45 



He Asn Pro Lys Leu Ala Gly Leu He Gly Arg His Gly Pro Gin Asn 
260 265 270 

Lys Gin Pro Phe Met Val Ala Phe Phe Lys Ala Thr Glu Val His Phe 
275 280 285 



Arg Ser He Arg Ser Thr Gly Ser Lys Gin Arg Ser Gin Asn Arg Ser 
50 290 295 300 
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Lys Thr Pro Lys Asn Gin Glu Ala Leu Arg Met Ala Asn Val Ala Glu 
305 310 315 320 

Asn Ser Ser Ser Asp Gin Arg Gin Ala Cys Lys Lys His Glu Leu Tyr 
5 325 330 335 

Val Ser Phe Arg Asp Leu Gly Trp Gin Asp Trp He He Ala Pro Glu 
340 345 350 

10 Gly Tyr Ala Ala Tyr Tyr Cys Glu Gly Glu Cys Ala Phe Pro Leu Asn 
355 360 365 

Ser Tyr Met Asn Ala Thr Asn His Ala He Val Gin Thr Leu Val His 
370 375 380 

Phe He Asn Pro Glu Thr Val Pro Lys Pro Cys Cys Ala Pro Thr Gin 
385 390 395 400 

Leu Asn Ala He Ser Val Leu Tyr Phe Asp Asp Ser Ser Asn Val He 
20 405 410 415 

Leu Lys Lys Tyr Arg Asn Met Val Val Arg Ala Cys Gly Cys His 
420 425 430 

25 (2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1873 base pairs 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

35 (iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 
40 (A) 0RGANISH: MURIDAE 

(F) TISSUE TYPE: EMBRYO 

(ix) FEATURE: 

(A) NAME/KEY: CDS 
45 (B) LOCATION: 104.. 1393 

(D) OTHER INFORMATION: /function* "OSTEOGENIC PROTEIN" 
/product- "MOPl" 
/note- "MOPl (CDNA)" 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
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CTGCAGCAAG TGACCTCGGG TCGTGGACCG CTGCCCTGCC CCCTCCGCTG CCACCTGGGG 60 

CGGCGCGGGC CCGGTGCCCC GGATCGCGCG TAGAGCCGGC GCG ATG CAC GTG CGC 115 

Met His Val Arg 
5 1 

TCG CTG CGC GCT GCG GCG CCA CAC AGC TTC GTG GCG CTC TGG GCG CCT 163 
Ser Leu Arg Ala Ala Ala Pro His Ser Phe Val Ala Leu Trp Ala Pro 
5 10 15 20 



10 



30 



CTG TTC TTG CTG CGC TCC GCC CTG GCC GAT TTC AGC CTG GAC AAC GAG 211 
Leu Phe Leu Leu Arg Ser Ala Leu Ala Asp Phe Ser Leu Asp Asn Glu 
25 30 35 



15 GTG CAC TCC AGC TTC ATC CAC CGG CGC CTC CGC AGC CAG GAG CGG CGG 259 
Val His Ser Ser Phe He His Arg Arg Leu Arg Ser Gin Glu Arg Arg 
40 45 50 

GAG ATG CAG CGG GAG ATC CTG TCC ATC TTA GGG TTG CCC CAT CGC CCG 307 
20 Glu Het Gin Arg Glu He Leu Ser He Leu Gly Leu Pro His Arg Pro 
55 60 65 

CGC CCG CAC CTC CAG GGA AAG CAT AAT TCG GCG CCC ATG TTC ATG TTG 355 
Arg Pro His Leu Gin Gly Lys His Asn Ser Ala Pro Met Phe Met Leu 
25 70 75 80 

GAC CTG TAC AAC GCC ATG GCG GTG GAG GAG AGC GGG CCG GAC GGA CAG 403 
Asp Leu Tyr Asn Ala Met Ala Val Glu Glu Ser Gly Pro Asp Gly Gin 
85 90 95 100 



GGC TTC TCC TAC CCC TAC AAG GCC GTC TTC AGT ACC CAG GGC CCC CCT 451 
Gly Phe Ser Tyr Pro Tyr Lys Ala Val Phe Ser Thr Gin Gly Pro Pro 
105 110 115 



35 TTA GCC AGC CTG CAG GAC AGC CAT TTC CTC ACT GAC GCC GAC ATG GTC 499 
Leu Ala Ser Leu Gin Asp Ser His Phe Leu Thr Asp Ala Asp Met Val 
120 125 130 

ATG AGC TTC GTC AAC CTA GTG GAA CAT GAC AAA GAA TTC TTC CAC CCT 547 
40 Met Ser Phe Val Asn Leu Val Glu His Asp Lys Glu Phe Phe His Pro 
135 140 145 

CGA TAC CAC CAT CGG GAG TTC CGG TTT GAT CTT TCC AAG ATC CCC GAG 595 
Arg Tyr His His Arg Glu Phe Arg Phe Asp Leu Ser Lys He Pro Glu 
45 150 155 160 

GGC GAA CGG GTG ACC GCA GCC GAA TTC AGG ATC TAT AAG GAC TAC ATC 643 
Gly Glu Arg Val Thr Ala Ala Glu Phe Arg He Tyr Lys Asp Tyr He 
165 170 175 180 

50 
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CGG GAG CGA TTT GAC AAC GAG ACC TTC CAG ATC ACA GTC TAT CAG GTG 

Arjt Glu Are Phe Asp Asn Glu Thr Phe Gin He Thr Val Tyr Gin Val 
8 185 190 195 

5 CTC CAG GAG CAC TCA GGC AGG GAG TCG GAC CTC TTC TTG CTG GAC AGC 

Leu Gin Glu His Ser Gly Arg Glu Ser Asp Leu Phe Leu Leu Asp Ser 
200 205 210 

CGC ACC ATC TGG GCT TCT GAG GAG GGC TGG TTG GTG TTT GAT ATC ACA 

10 Arg Thr He Trp Ala Ser Glu Glu Gly Trp Leu Val Phe Asp He Thr 
215 



25 



35 



40 



45 



220 



225 



GCC ACC AGC AAC CAC TGG GTG GTC AAC CCT CGG CAC AAC CTG GGC TTA 
Ala Thr Ser Asn His Trp Val Val Asn Pro Arg His Asn Leu Gly Leu 
15 230 235 240 

CAG CTC TCT GTG GAG ACC CTG GAT GGG CAG AGC ATC AAC CCC AAG TTG 

Gin Leu Ser Val Glu Thr Leu Asp Gly Gin Ser He Asn Pro Lys Leu 

245 250 255 260 

20 

GCA GGC CTG ATT GGA CGG CAT GGA CCC CAG AAC AAG CAA CCC TTC ATG 

Ala Gly Leu He Gly Arg His Gly Pro Gin Asn Lys Gin Pro Phe Met 

265 270 275 

GTG GCC TTC TTC AAG GCC ACG GAA GTC CAT CTC CGT AGT ATC CGG TCC 
Val Ala Phe Phe Lys Ala Thr Glu Val His Leu Arg Ser He Arg Ser 
280 285 290 

ACG GGG GGC AAG CAG CGC AGC CAG AAT CGC TCC AAG ACG CCA AAG AAC 
30 Thr Gly Gly Lys Gin Arg Ser Gin Asn Arg Ser Lys Thr Pro Lys Asn 
295 300 305 

CAA GAG GCC CTG AGG ATG GCC AGT GTG GCA GAA AAC AGC AGC AGT GAC 
Gin Glu Ala Leu Arg Met Ala Ser Val Ala Glu Asn Ser Ser Ser Asp 
310 315 320 

CAG AGG CAG GCC TGC AAG AAA CAT GAG CTG TAC GTC AGC TTC CGA GAC 
Gin Arg Gin Ala Cys Lys Lys His Glu Leu Tyr Val Ser Phe Arg Asp 
325 330 335 340 

CTT GGC TGG CAG GAC TGG ATC ATT GCA CCT GAA GGC TAT GCT GCC TAC 
Leu Gly Trp Gin Asp Trp He He Ala Pro Glu Gly Tyr Ala Ala Tyr 
345 350 355 

TAC TGT GAG GGA GAG TGC GCC TTC CCT CTG AAC TCC TAC ATG AAC GCC 
Tyr Cys Glu Gly Glu Cys Ala Phe Pro Leu Asn Ser Tyr Met Asn Ala 
360 ' 365 370 



691 



739 



787 



835 



883 



931 



979 



1027 



1075 



1123 



1171 



1219 
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ACC AAC CAC GCC ATC GTC CAG ACA CTG GTT CAC TTC ATC AAC CCA GAC 1267 
Thr Asn His Ala lie Val Gin Thr Leu Val His Phe lie Asn Pro Asp 
375 380 385 

5 ACA GTA CCC AAG CCC TGC TGT GCG CCC ACC CAG CTC AAC GCC ATC TCT 1315 
Thr Val Pro Lys Pro Cys Cys Ala Pro Thr Gin Leu Asn Ala He Ser 
390 395 400 

GTC CTC TAC TTC GAC GAC AGC TCT AAT GTC GAC CTG AAG AAG TAC AGA 1363 
10 Val Leu Tyr Phe Asp Asp Ser Ser Asn Val Asp Leu Lys Lys Tyr Arg 
405 410 415 420 

AAC ATG GTG GTC CGG GCC TGT GGC TGC CAC TAGCTCTTCC TGAGACCCTG 1413 
Asn Het Val Val Arg Ala Cys Gly Cys His 
15 425 430 

ACCTTTGCGG GGCCACACCT TTCCAAATCT TCGATGTCTC ACCATCTAAG TCTCTCACTG 1473 

CCCACCTTGG CGAGGAGAAC AGACCAACCT CTCCTGAGCC TTCCCTCACC TCCCAACCGG 1533 

20 

AAGCATGTAA GGGTTCCAGA AACCTGAGCG TGCAGCAGCT GATGAGCGCC CTTTCCTTCT 1593 

GGCACGTGAC GGACAAGATC CTACCAGCTA CCACAGCAAA CGCCTAAGAG CAGGAAAAAT 1653 

25 GTCTGCCAGG AAAGTGTCCA GTGTCCACAT GGCCCCTGGC GCTCTGAGTC TTTGAGGAGT 1713 

AATCGCAAGC CTCGTTCAGC TGCAGCAGAA GGAAGGGCTT AGCCAGGGTG GGCGCTGGCG 1773 

TCTGTGTTGA AGGGAAACCA AGCAGAAGCC ACTGTAATGA TATGTCACAA TAAAACCCAT 1833 

30 ltJ ,., 
GAATGAAAAA AAAAAAAAAA AAAAAAAAAA AAAAGAATTC 18/3 

(2) INFORMATION FOR SEQ ID NO: 19: 

35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 430 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

40 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

45 Met His Val Arg Ser Leu Arg Ala Ala Ala Pro His Ser Phe Val Ala 
1 5 10 15 



Leu Trp Ala Pro Leu Phe Leu Leu Arg Ser Ala Leu Ala Asp Phe Ser 
20 25 30 



50 
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Leu Asp Asn Glu Val His Ser Ser Phe lie His Arg Arg Leu Arg Ser 
35 40 45 

Gin Glu Arg Arg Glu Het Gin Arg Glu He Leu Ser He Leu Gly Leu 
5 50 55 60 

Pro His Arg Pro Arg Pro His Leu Gin Gly Lys His Asn Ser Ala Pro 
65 70 75 80 

10 Met Phe Met Leu Asp Leu Tyr Asn Ala Met Ala Val Glu Glu Ser Gly 

85 90 95 



15 



Pro Asp Gly Gin Gly Phe Ser Tyr Pro Tyr Lys Ala Val Phe Ser Thr 
100 105 HO 

Gin Gly Pro Pro Leu Ala Ser Leu Gin Asp Ser His Phe Leu Thr Asp 
115 120 125 



Ala Asp Met Val Met Ser Phe Val Asn Leu Val Glu His Asp Lys Glu 
20 130 135 140 

Phe Phe His Pro Arg Tyr His His Arg Glu Phe Arg Phe Asp Leu Ser 
145 150 155 160 

25 Lys He Pro Glu Gly Glu Arg Val Thr Ala Ala Glu Phe Arg He Tyr 

165 170 175 



30 



Lys Asp Tyr He Arg Glu Arg Phe Asp Asn Glu Thr Phe Gin He Thr 
7 180 185 190 

Val Tyr Gin Val Leu Gin Glu His Ser Gly Arg Glu Ser Asp Leu Phe 
195 200 205 



Leu Leu Asp Ser Arg Thr He Trp Ala Ser Glu Glu Gly Trp Leu Val 
35 210 215 220 

Phe Asp He Thr Ala Thr Ser Asn His Trp Val Val Asn Pro Arg His 
225 230 235 240 

40 Asn Leu Gly Leu Gin Leu Ser Val Glu Thr Leu Asp Gly Gin Ser He 

245 250 255 



45 



Asn Pro Lys Leu Ala Gly Leu He Gly Arg His Gly Pro Gin Asn Lys 
260 265 270 

Gin Pro Phe Met Val Ala Phe Phe Lys Ala Thr Glu Val His Leu Arg 
275 280 285 
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Ser He Arg Ser Thr Gly Gly Lys Gin Arg Ser Gin Asn Arg Ser Lys 
290 295 300 

Thr Pro Lys Asn Gin Glu Ala Leu Arg Met Ala Ser Val Ala Glu Asn 
5 305 310 315 320 

Ser Ser Ser Asp Gin Arg Gin Ala Cys Lys Lys His Glu Leu Tyr Val 
325 330 335 

10 Ser Phe Arg Asp Leu Gly Trp Gin Asp Trp He He Ala Pro Glu Gly 
340 345 350 

Tyr Ala Ala Tyr Tyr Cys Glu Gly Glu Cys Ala Phe Pro Leu Asn Ser 
355 360 365 

15 

Tyr Met Asn Ala Thr Asn His Ala He Val Gin Thr Leu Val His Phe 
370 375 380 

He Asn Pro Asp Thr Val Pro Lys Pro Cys Cys Ala Pro Thr Gin Leu 
20 385 390 395 400 

Asn Ala He Ser Val Leu Tyr Phe Asp Asp Ser Ser Asn Val Asp Leu 
405 410 415 

25 Lys Lys Tyr Arg Asn Met Val Val Arg Ala Cys Gly Cys His 
420 425 430 

(2) INFORMATION FOR SEQ ID NO: 20: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1723 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



35 



(ii) MOLECULE TYPE: cDNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 
40 (F) TISSUE TYPE: HIPPOCAMPUS 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 490.. 1696 

45 (D) OTHER INFORMATION: /function= "OSTEOGENIC PROTEIN" 

/product* "hOP2-PP" 
/note= "h0P2 (cDNA)" 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
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GGCGCCGGCA GAGCAGGAGT GGCTGGAGGA GCTGTGGTTG GAGCAGGAGG TGGCACGGCA 


bu 




GGGCTGGAGG GCTCCCTATG AGTGGCGGAG ACGGCCCAGG AGGCGCTGGA GCAACAGCTC 


inn 

120 


5 


CCACACCGCA CCAAGCGGTG GCTGCAGGAG CTCGCCCATC GCCCCTGCGC TGCTCGGACC 


180 




GCGGCCACAG CCGGACTGGC GGGTACGGCG GCGACAGAGG CATTGGCCGA GAGTCCCAGT 


240 


10 


CCGCAGAGTA GCCCCGGCCT CGAGGCGGTG GCGTCCCGGT CCTCTCCGTC CAGGAGCCAG 


300 


GACAGGTGTC GCGCGGCGGG GCTCCAGGGA CCGCGCCTGA GGCCGGCTGC CCGCCCGTCC 


360 




CGCCCCGCCC CGCCGCCCGC CGCCCGCCGA GCCCAGCCTC 


CTTGCCGTCG GGGCGTCCCC 


420 


15 


AGGCCCTGGG TCGGCCGCGG AGCCGATGCG CGCCCGCTGA GCGCCCCAGC TGAGCGCCCC 


480 




CGGCCTGCC ATG ACC GCG CTC CCC GGC CCG CTC TGG CTC CTG GGC CTG 
Met Thr Ala Leu Pro Gly Pro Leu Trp Leu Leu Gly Leu 
1 5 10 


528 


20 


GCG CTA TGC GCG CTG GGC GGG GGC GGC CCC GGC 
Ala Leu Cys Ala Leu Gly Gly Gly Gly Pro Gly 
15 20 


CTG CGA CCC 
Leu Arg Pro 
25 


CCG 
Pro 


CCC 
Pro 


576 


25 


GGC TGT CCC CAG CGA CGT CTG GGC GCG CGC GAG 
Gly Cys Pro Gin Arg Arg Leu Gly Ala Arg Glu 
30 35 40 


CGC CGG GAC 
Arg Arg Asp 


GTG 
Val 


CAG 
Gin 
45 


624 


30 


CGC GAG ATC CTG GCG GTG CTC GGG CTG CCT GGG 
Arg Glu He Leu Ala Val Leu Gly Leu Pro Gly 
50 55 


CGG CCC CGG 
Arg Pro Arg 


CCC 
Pro 
60 


CGC 
Arg 


672 


35 


GCG CCA CCC GCC GCC TCC CGG CTG CCC GCG TCC 
Ala Pro Pro Ala Ala Ser Arg Leu Pro Ala Ser 
65 70 


GCG CCG CTC 
Ala Pro Leu 
75 


TTC 
Pbe 


ATG 
Het 


720 


40 


CTG GAC CTG TAC CAC GCC ATG GCC GGC GAC GAC 
Leu Asp Leu Tyr His Ala Met Ala Gly Asp Asp 
80 85 


GAC GAG GAC 
Asp Glu Asp 
90 


GGC 
Gly 


GCG 
Ala 


. 768 


CCC GCG GAG CGG CGC CTG GGC CGC GCC GAC CTG 
Pro Ala Glu Arg Arg Leu Gly Arg Ala Asp Leu 
95 100 


GTC ATG AGC 
Val Met Ser 
105 


TTC 
Phe 


GTT 
Val 


816 


45 


AAC ATG GTG GAG CGA GAC CGT GCC CTG GGC CAC 
Asn Met Val Glu Arg Asp Arg Ala Leu Gly His 
110 115 120 


CAG GAG CCC 
Gin Glu Pro 


CAT 
His 


TGG 
Trp 
125 


864 
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AAG GAG TTC CGC TTT GAC CTG ACC CAG ATC CCG GCT GGG GAG GCG GTC 
Lys Glu Phe Arg Phe Asp Leu Thr Gin He Pro Ala Gly Glu Ala Val 
130 135 140 

5 ACA GCT GCG GAG TTC CGG ATT TAC AAG GTG CCC AGC ATC CAC CTG CTC 
Thr Ala Ala Glu Phe Arg He Tyr Lys Val Pro Ser He His Leu Leu 
145 150 155 

AAC AGG ACC CTC CAC GTC AGC ATG TTC CAG GTG GTC CAG GAG CAG TCC 
10 Asn Arg Thr Leu His Val Ser Met Phe Gin Val Val Gin Glu Gin Ser 
160 165 170 



15 



20 



25 



AAC AGG GAG TCT GAC TTG TTC TTT TTG GAT CTT CAG ACG CTC CGA GCT 
Asn Arg Glu Ser Asp Leu Phe Phe Leu Asp Leu Gin Thr Leu Arg Ala 
175 180 185 

GGA GAC GAG GGC TGG CTG GTG CTG GAT GTC ACA GCA GCC AGT GAC TGC 
Gly Asp Glu Gly Trp Leu Val Leu Asp Val Thr Ala Ala Ser Asp Cys 
190 195 200 205 

TGG TTG CTG AAG CGT CAC AAG GAC CTG GGA CTC CGC CTC TAT GTG GAG 
Trp Leu Leu Lys Arg His Lys Asp Leu Gly Leu Arg Leu Tyr Val Glu 
210 215 220 

ACT GAG GAC GGG CAC AGC GTG GAT CCT GGC CTG GCC GGC CTG CTG GGT 
Thr Glu Asp Gly His Ser Val Asp Pro Gly Leu Ala Gly Leu Leu Gly 
225 230 235 



CAA CGG GCC CCA CGC TCC CAA CAG CCT TTC GTG GTC ACT TTC TTC AGG 
30 Gin Arg Ala Pro Arg Ser Gin Gin Pro Phe Val Val Thr Phe Phe Arg 
240 245 250 

GCC AGT CCG AGT CGC ATC CGC ACC CCT CGG GCA GTG AGG CCA CTG AGG 
Ala Ser Pro Ser Pro He Arg Thr Pro Arg Ala Val Arg Pro Leu Arg 
35 255 260 265 

AGG AGG CAG CCG AAG AAA AGC AAC GAG CTG CCG CAG GCC AAC CGA CTC 
Arg Arg Gin Pro Lys Lys Ser Asn Glu Leu Pro Gin Ala Asn Arg Leu 
270 275 280 285 

40 

CCA GGG ATC TTT GAT GAC GTC CAC GGC TCC CAC GGC CGG CAG GTC TGC 
Pro Gly He Phe Asp Asp Val His Gly Ser His Gly Arg Gin Val Cys 
290 295 300 

45 CGT CGG CAC GAG CTC TAC GTC AGC TTC CAG GAC CTC GGC TGG CTG GAC 
Arg Arg His Glu Leu Tyr Val Ser Phe Gin Asp Leu Gly Trp Leu Asp 
305 310 315 



912 



960 



1008 



1056 



1104 



1152 



1200 



1248 



1296 



1344 



1392 



1440 
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TGG GTC ATC GCT CCC CAA GGC TAC TCG GCC TAT TAC TGT GAG GGG GAG 1488 
Trp Val He Ala Pro Gin Gly Tyr Ser Ala Tyr Tyr Cys Glu Gly Glu 
320 325 330 

5 TGC TCC TTC CCA CTG GAC TCC TGC ATG AAT GCC ACC AAC CAC GCC ATC 1536 
Cys Ser Phe Pro Leu Asp Ser Cys Met Asn Ala Thr Asn His Ala He 
335 340 345 

CTG CAG TCC CTG GTG CAC CTG ATG AAG CCA AAC GCA GTC CCC AAG GCG ( 1584 
10 Leu Gin Ser Leu Val His Leu Met Lys Pro Asn Ala Val Pro Lys Ala 
350 355 360 365 

TGC TGT GCA CCC ACC AAG CTG AGC GCC ACC TCT GTG CTC TAC TAT GAC 1632 
Cys Cys Ala Pro Thr Lys Leu Ser Ala Thr Ser Val Leu Tyr Tyr Asp 
15 370 375 380 

AGC AGC AAC AAC GTC ATC CTG CGC AAA GCC CGC AAC ATG GTG GTC AAG 1680 
Ser Ser Asn Asn Val He Leu Arg Lys Ala Arg Asn Met Val Val Lys 
385 390 395 



20 



25 



GCC TGC GGC TGC CAC T GAGTCAGCCC GCCCAGCCCT ACTGCAG 1723 
Ala Cys Gly Cys His 
400 

(2) INFORMATION FOR SEQ ID NO: 21: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 402 amino acids 
30 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:21: 

Met Thr Ala Leu Pro Gly Pro Leu Trp Leu Leu Gly Leu Ala Leu Cys 
15 10 15 

40 Ala Leu Gly Gly Gly Gly Pro Gly Leu Arg Pro Pro Pro Gly Cys Pro 
20 25 30 

Gin Arg Arg Leu Gly Ala Arg Glu Arg Arg Asp Val Gin Arg Glu He 
35 40 45 

45 

Leu Ala Val Leu Gly Leu Pro Gly Arg Pro Arg Pro Arg Ala Pro Pro 
50 55 60 
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Ala Ala Ser Are Leu Pro Ala Ser Ala Pro Leu Phe Met Leu Asp Leu 
65 70 75 80 

Tyr His Ala Met Ala Gly Asp Asp Asp Glu Asp Gly Ala Pro Ala Glu 
5 85 90 95 

Arg Arg Leu Gly Arg Ala Asp Leu Val Met Ser Phe Val Asn Met Val 
100 105 110 

10 Glu Arg Asp Arg Ala Leu Gly His Gin Glu Pro His Trp Lys Glu Phe 
115 120 125 



15 



20 



30 



35 



Arg Phe Asp Leu Thr Gin lie Pro Ala Gly Glu Ala Val Thr Ala Ala 
130 135 1A0 

Glu Phe Arg He Tyr Lys Val Pro Ser He His Leu Leu Asn Arg Thr 
145 150 155 160 

Leu His Val Ser Met Phe Gin Val Val Gin Glu Gin Ser Asn Arg Glu 
165 170 175 

Ser Asp Leu Phe Phe Leu Asp Leu Gin Thr Leu Arg Ala Gly Asp Glu 
180 185 190 



25 Gly Trp Leu Val Leu Asp Val Thr Ala Ala Ser Asp Cys Trp Leu Leu 
195 200 205 



Lys Arg His Lys Asp Leu Gly Leu Arg Leu Tyr Val Glu Thr Glu Asp 
210 215 220 

Gly His Ser Val Asp Pro Gly Leu Ala Gly Leu Leu Gly Gin Arg Ala 
225 230 235 240 

Pro Arg Ser Gin Gin Pro Phe Val Val Thr Phe Phe Arg Ala Ser Pro 
245 250 255 

Ser Pro He Arg Thr Pro Arg Ala Val Arg Pro Leu Arg Arg Arg Gin 
260 265 270 



40 Pro Lys Lys Ser Asn Glu Leu Pro Gin Ala Asn Arg Leu Pro Gly He 
275 280 285 



45 



50 



Phe Asp Asp Val His Gly Ser His Gly Arg Gin Val Cys Arg Arg His 
290 295 300 

Glu Leu Tyr Val Ser Phe Gin Asp Leu Gly Trp Leu Asp Trp Val He 
305 310- 315 320 

Ala Pro Gin Gly Tyr Ser Ala Tyr Tyr Cys Glu Gly Glu Cys Ser Phe 
325 330 335 
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Pro Leu Asp Ser Cys Met Asn Ala Thr Asn His Ala lie Leu Gin Ser 
3A0 345 350 

Leu Val His Leu Met Lys Pro Asn Ala Val Pro Lys Ala Cys Cys Ala 
5 355 360 365 

Pro Thr Lys Leu Ser Ala Thr Ser Val Leu Tyr Tyr Asp Ser Ser Asn 
370 375 380 

10 Asn Val He Leu Arg Lys Ala Arg Asn Met Val Val Lys Ala Cys Gly 
385 390 395 400 



15 



Cys His 



(2) INFORMATION FOR SEQ ID NO: 22: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1926 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: HURIDAE 

(F) TISSUE TYPE: EMBRYO 

(ix) FEATURE: 

(A) NAME/KEY: CDS 
30 (B) LOCATION: 93.. 1289 

(D) OTHER INFORMATION: /function- "OSTEOGENIC PROTEIN" 
/product- "mOP2-PP" 
/note- "mOP2 cDNA" 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

GCCAGGCACA GGTGCGCCGT CTGGTCCTCC CCGTCTGGCG TCAGCCGAGC CCGACCAGCT 60 

40 ACCAGTGGAT GCGCGCCGGC TGAAAGTCCG AG ATG GCT ATG CGT CCC GGG CCA 113 

Met Ala Met Arg Pro Gly Pro 
1 5 

CTC TGG CTA TTG GGC CTT GCT CTG TGC GCG CTG GGA GGC GGC CAC GGT 161 
45 Leu Trp Leu Leu Gly Leu Ala Leu Cys Ala Leu Gly Gly Gly His Gly 
10 15 20 

CCG CGT CCC CCG CAC ACC TGT CCC CAG CGT CGC CTG GGA GCG CGC GAG 209 
Pro Arg Pro Pro His Thr Cys Pro Gin Arg Arg Leu Gly Ala Arg Glu 
50 25 30 35 
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CGC CGC GAC ATG CAG CGT GAA ATC CTG GCG GTG CTC GGG CTA CCG GGA 
Are Are Asp Met Gin Are Glu lie Leu Ala Val Leu Gly Leu Pro Gly 
40 45 50 55 

5 CGG CCC CGA CCC CGT GCA CAA CCC GCC GCT GCC CGG CAG CCA GCG TCC 
Are Pro Are Pro Are Ala Gin Pro Ala Ala Ala Arg Gin Pro Ala Ser 
60 65 70 

GCG CCC CTC TTC ATG TTG GAC CTA TAC CAC GCC ATG ACC GAT GAC GAC 
10 Ala Pro Leu Phe Met Leu Asp Leu Tyr His Ala Met Thr Asp Asp Asp 
75 80 85 

GAC GGC GGG CCA CCA CAG GCT CAC TTA GGC CGT GCC GAC CTG GTC ATG 
Asp Gly Gly Pro Pro Gin Ala His Leu Gly Arg Ala Asp Leu Val Met 
15 90 95 100 

AGC TTC GTC AAC ATG GTG GAA CGC GAC CGT ACC CTG GGC TAC CAG GAG 
Ser Phe Val Asn Met Val Glu Arg Asp Arg Thr Leu Gly Tyr Gin Glu 
105 110 115 



20 



25 



30 



35 



40 



45 



CCA CAC TGG AAG GAA TTC CAC TTT GAC CTA ACC CAG ATC CCT GCT GGG 
Pro His Trp Lys Glu Phe His Phe Asp Leu Thr Gin He Pro Ala Gly 
120 125 130 135 

GAG GCT GTC ACA GCT GCT GAG TTC CGG ATC TAC AAA GAA CCC AGC ACC 
Glu Ala Val Thr Ala Ala Glu Phe Arg He Tyr Lys Glu Pro Ser Thr 
140 145 150 

CAC CCG CTC AAC ACA ACC CTC CAC ATC AGC ATG TTC GAA GTG GTC CAA 
His Pro Leu Asn Thr Thr Leu His He Ser Met Phe Glu Val Val Gin 
155 160 165 

GAG CAC TCC AAC AGG GAG TCT GAC TTG TTC TTT TTG GAT CTT CAG ACG 
Glu His Ser Asn Arg Glu Ser Asp Leu Phe Phe Leu Asp Leu Gin Thr 
170 175 180 

CTC CGA TCT GGG GAC GAG GGC TGG CTG GTG CTG GAC ATC ACA GCA GCC 
Leu Arg Ser Gly Asp Glu Gly Trp Leu Val Leu Asp He Thr Ala Ala 
185 190 195 

AGT GAC CGA TGG CTG CTG AAC CAT CAC AAG GAC CTG GGA CTC CGC CTC 
Ser Asp Arg Trp Leu Leu Asn His His Lys Asp Leu Gly Leu Arg Leu 
200 205 210 215 

TAT GTG GAA ACC GCG GAT GGG CAC AGC ATG GAT CCT GGC CTG GCT GGT 
Tyr Val Glu Thr Ala Asp Gly His Ser Met Asp Pro Gly Leu Ala Gly 
220 225 230. 



257 



305 



353 



401 



449 



497 



545 



593 



641 



689 



737 



785 
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CTG CTT GGA CGA CAA GCA CCA CGC TCC AGA CAG CCT TTC ATG GTA ACC 833 
Leu Leu Gly Arg Gin Ala Pro Arg Ser Arg Gin Pro Phe Met Val Thr 
235 240 245 

5 TTC TTC AGG GCC AGC CAG AGT CCT GTG CGG GCC CCT CGG GCA GCG AGA 881 
Phe Phe Arg Ala Ser Gin Ser Pro Val Arg Ala Pro Arg Ala Ala Arg 
250 255 260 

CCA CTG AAG AGG AGG CAG CCA AAG AAA ACG AAC GAG CTT CCG CAC CCC 929 
10 Pro Leu Lys Arg Arg Gin Pro Lys Lys Thr Asn Glu Leu Pro His Pro 
265 270 275 

AAC AAA CTC CCA GGG ATC TTT GAT GAT GGC CAC GGT TCC CGC GGC AGA 977 
Asn Lys Leu Pro Gly He Phe Asp Asp Gly His Gly Ser Arg Gly Arg 
15 280 285 290 295 

GAG GTT TGC CGC AGG CAT GAG CTC TAC GTC AGC TTC CGT GAC CTT GGC 1025 
Glu Val Cys Arg Arg His Glu Leu Tyr Val Ser Phe Arg Asp Leu Gly 
300 305 310 

20 

TGG CTG GAC TGG GTC ATC GCC CCC CAG GGC TAC TCT GCC TAT TAC TGT 1073 
Trp Leu Asp Trp Val He Ala Pro Gin Gly Tyr Ser Ala Tyr Tyr Cys 
315 320 325 

25 GAG GGG GAG TGT GCT TTC CCA CTG GAC TCC TGT ATG AAC GCC ACC AAC 1121 
Glu Gly Glu Cys Ala Phe Pro Leu Asp Ser Cys Met Asn Ala Thr Asn 
330 335 340 

CAT GCC ATC TTG CAG TCT CTG GTG CAC CTG ATG AAG CCA GAT GTT GTC 1169 
30 His Ala He Leu Gin Ser Leu Val His Leu Met Lys Pro Asp Val Val 
345 350 355 

CCC AAG GCA TGC TGT GCA CCC ACC AAA CTG AGT GCC ACC TCT GTG CTG 1217 
Pro Lys Ala Cys Cys Ala Pro Thr Lys Leu Ser Ala Thr Ser Val Leu 
35 360 365 370 375 

TAC TAT GAC AGC AGC AAC AAT GTC ATC CTG CGT AAA CAC CGT AAC ATG 1265 
Tyr Tyr Asp Ser Ser Asn Asn Val He Leu Arg Lys His Arg Asn Met 
7 380 385 390 

40 

GTG GTC AAG GCC TGT GGC TGC CAC TGAGGCCCCG CCCAGCATCC TGCTTCTACT 1319 
Val Val Lys Ala Cys Gly Cys His 
395 

45 ACCTTACCAT CTGGCCGGGC CCCTCTCCAG AGGCAGAAAC CCTTCTATGT TATCATAGCT 1379 
CAGACAGGGG CAATGGGAGG CCCTTCACTT CCCCTGGCCA CTTCCTGCTA AAATTCTGGT 1439 
CTTTCCCAGT TCCTCTGTCC TTCATGGGGT TTCGGGGCTA TCACCCCGCC CTCTCCATCC 1499 

50 
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TCCTACCCCA AGCATAGACT GAATGCACAC AGCATCCCAG AGCTATGCTA ACTGAGAGGT 1559 

CTGGGGTCAG CACTGAAGGC CCACATGAGG AAGACTGATC CTTGGCCATC CTCAGCCCAC 1619 

AATGGCAAAT TCTGGATGGT CTAAGAAGGC CCTGGAATTC TAAACTAGAT GATCTGGGCT 1679 

CTCTGCACCA TTCATTGTGG CAGTTGGGAC ATTTTTAGGT ATAACAGACA CATACACTTA 1739 

GATCAATGCA TCGCTGTACT CCTTGAAATC AGAGCTAGCT TGTTAGAAAA AGAATCAGAG 1799 

CCAGGTATAG CGGTGCATGT CATTAATCCC AGCGCTAAAG AGACAGAGAC AGGAGAATCT 1859 

CTGTGAGTTC AAGGCCACAT AGAAAGAGCC TGTCTCGGGA GCAGGAAAAA AAAAAAAAAC 1919 

15 GGAATTC 1926 

(2) INFORMATION FOR SEQ ID N0:23: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 399 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:23: 

Met Ala Met Arg Pro Gly Pro Leu Trp Leu Leu Gly Leu Ala Leu Cys 
30 1 5 10 15 

Ala Leu Gly Gly Gly His Gly Pro Arg Pro Pro His Thr Cys Pro Gin 
20 25 30 

35 Arg Arg Leu Gly Ala Arg Glu Arg Arg Asp Met Gin Arg Glu He Leu 
35 40 45 



40 



Ala Val Leu Gly Leu Pro Gly Arg Pro Arg Pro Arg Ala Gin Pro Ala 
50 55 60 

Ala Ala Arg Gin Pro Ala Ser Ala Pro Leu Phe Met Leu Asp Leu Tyr 
65 70 75 80 



His Ala Met Thr Asp Asp Asp Asp Gly Gly Pro Pro Gin Ala His Leu 
45 85 90 95 

Gly Arg Ala Asp Leu Val Met Ser Phe Val Asn Met Val Glu Arg Asp 
100 105 110 
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Are Thr Leu Gly Tyr Gin Glu Pro His Trp Lys Glu Phe His Phe Asp 
115 120 125 

Leu Thr Gin He Pro Ala Gly Glu Ala Val Thr Ala Ala Glu Phe Arg 
5 130 135 140 

lie Tyr Lys Glu Pro Ser Thr His Pro Leu Asn Thr Thr Leu His lie 



145 



150 



155 



10 Ser Met Phe Glu Val Val Gin Glu His Ser Asn Arg Glu Ser Asp Leu 

165 170 175 



15 



Phe Phe Leu Asp Leu Gin Thr Leu Arg Ser Gly Asp Glu Gly Trp Leu 
180 185 190 

Val Leu Asp He Thr Ala Ala Ser Asp Arg Trp Leu Leu Asn His His 
195 200 205 



Lys Asp Leu Gly Leu Arg Leu Tyr Val Glu Thr Ala Asp Gly His Ser 
20 210 215 220 

Met Asp Pro Gly Leu Ala Gly Leu Leu Gly Arg Gin Ala Pro Arg Ser 
225 230 235 240 

25 Arg Gin Pro Phe Met Val Thr Phe Phe Arg Ala Ser Gin Ser Pro Val 

245 250 255 



30 



Arg Ala Pro Arg Ala Ala Arg Pro Leu Lys Arg Arg Gin Pro Lys Lys 
260 265 270 

Thr Asn Glu Leu Pro His Pro Asn Lys Leu Pro Gly He Phe Asp Asp 
275 280 285 



Gly His Gly Ser Arg Gly Arg Glu Val Cys Arg Arg His Glu Leu Tyr 
35 290 295 300 

Val Ser Phe Arg Asp Leu Gly Trp Leu Asp Trp Val He Ala Pro Gin 
305 310 315 320 

40 Gly Tyr Ser Ala Tyr Tyr Cys Glu Gly Glu Cys Ala Phe Pro Leu Asp 

325 330 335 



Ser Cys Met Asn Ala Thr Asn His Ala He Leu Gin Ser Leu Val His 
340 345 350 



45 
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Leu Het Lys Pro Asp Val Val Pro Lys Ala Cys Cys Ala Pro Thr Lys 
355 360 365 

Leu Ser Ala Thr Ser Val Leu Tyr Tyr Asp Ser Ser Asn Asn Val He 
5 370 375 380 

Leu Arg Lys His Arg Asn Met Val Val Lys Ala Cys Gly Cys His 
385 390 395 

10 (2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1368 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

20 

(ix) FEATURE: 

(A) NAME/KEY: COS 

(B) LOCATION: 1. .1368 



25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 

ATG TCG GGA CTG CGA AAC ACC TCG GAG GCC GTT GCA GTG CTC GCC TCC 48 
Met Ser Gly Leu Arg Asn Thr Ser Glu Ala Val Ala Val Leu Ala Ser 
30 1 5 10 15 

CTG GGA CTC GGA ATG GTT CTG CTC ATG TTC GTG GCG ACC ACG CCG CCG 96 
Leu Gly Leu Gly Met Val Leu Leu Met Phe Val Ala Thr Thr Pro Pro 
20 25 30 

35 

GCC GTT GAG GCC ACC CAG TCG GGG ATT TAC ATA GAC AAC GGC AAG GAC 144 
Ala Val Glu Ala Thr Gin Ser Gly He Tyr He Asp Asn Gly Lys Asp 
35 40 45 



40 CAG ACG ATC ATG CAC AGA GTG CTG AGC GAG GAC GAC AAG CTG GAC GTC 192 
Gin Thr He Met His Arg Val Leu Ser Glu Asp Asp Lys Leu Asp Val 
50 55 60 



TCG TAC GAG ATC CTC GAG TTC CTG GGC ATC GCC GAA CGG CCG ACG CAC 240 
45 Ser Tyr Glu He Leu Glu Phe Leu Gly He Ala Glu Arg Pro Thr His 
65 70 75 80 
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CTG AGC AGC CAC CAG TTG TCG CTG AGG AAG TCG GCT CCC AAG TTC CTG 
Leu Ser Ser His Gin Leu Ser Leu Arg Lys Ser Ala Pro Lys Phe Leu 
85 90 95 



20 



288 



5 CTG GAC GTC TAC CAC CGC ATC ACG GCG GAG GAG GGT CTC AGC GAT CAG 336 
Leu Asp Val Tyr His Arg lie Thr Ala Glu Glu Gly Leu Ser Asp Gin 
100 105 110 

GAT GAG GAC GAC GAC TAC GAA CGC GGC CAT CGG TCC AGG AGG AGC GCC 384 
10 Asp Glu Asp Asp Asp Tyr Glu Arg Gly His Arg Ser Arg Arg Ser Ala 
115 120 125 

GAC CTC GAG GAG GAT GAG GGC GAG CAG CAG AAG AAC TTC ATC ACC GAC 432 
Asp Leu Glu Glu Asp Glu Gly Glu Gin Gin Lys Asn Phe He Thr Asp 
15 130 135 140 

CTG GAC AAG CGG GCC ATC GAC GAG AGC GAC ATC ATC ATG ACC TTC CTG 480 
Leu Asp Lys Arg Ala He Asp Glu Ser Asp He He Met Thr Phe Leu 
145 150 155 160 

AAC AAG CGC CAC CAC AAT GTG GAC GAA CTG CGT CAC GAG CAC GGC CGT 528 
Asn Lys Arg His His Asn Val Asp Glu Leu Arg His Glu His Gly Arg 
165 170 175 



25 CGC CTG TGG TTC GAC GTC TCC AAC GTG CCC AAC GAC AAC TAC CTG GTG 576 
Arg Leu Trp Phe Asp Val Ser Asn Val Pro Asn Asp Asn Tyr Leu Val 
180 185 190 

ATG GCC GAG CTG CGC ATC TAT CAG AAC GCC AAC GAG GGC AAG TGG CTG 624 
30 Met Ala Glu Leu Arg He Tyr Gin Asn Ala Asn Glu Gly Lys Trp Leu 
195 200 205 

ACC GCC AAC AGG GAG TTC ACC ATC ACG GTA TAC GCC ATT GGC ACC GGC 672 
Thr Ala Asn Arg Glu Phe Thr He Thr Val Tyr Ala He Gly Thr Gly 
35 210 215 220 

ACG CTG GGC CAG CAC ACC ATG GAG CCG CTG TCC TCG GTG AAC ACC ACC 720 
Thr Leu Gly Gin His Thr Het Glu Pro Leu Ser Ser Val Asn Thr Thr 
225 230 235 240 

40 

GGG GAC TAC GTG GGC TGG TTG GAG CTC AAC GTG ACC GAG GGC CTG CAC 768 
Glv Asd Tyr Val Gly Trp Leu Glu Leu Asn Val Thr Glu Gly Leu His 
245 250 255 

45 GAG TGG CTG GTC AAG TCG AAG GAC AAT CAT GGC ATC TAC ATT GGA GCA 816 
Glu Trp Leu Val Lys Ser Lys Asp Asn His Gly He Tyr He Gly Ala 
260 265 270 
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CAC GCT GTC AAC CGA CCC GAC CGC GAG GTG AAG CTG GAC GAC ATT GGA 864 
His Ala Val Asn Arg Pro Asp Arg Glu Val Lys Leu Asp Asp lie Gly 
275 280 285 

5 CTG ATC CAC CGC AAG GTG GAC GAC GAG TTC CAG CCC TTC ATG ATC GGC 912 
Leu He His Arg Lys Val Asp Asp Glu Phe Gin Pro Phe Met He Gly 
290 295 300 

TTC TTC CGC GGA CCG GAG CTG ATC AAG GCG ACG GCC CAC AGC AGC CAC 960 
10 Phe Phe Arg Gly Pro Glu Leu He Lys Ala Thr Ala His Ser Ser His 
305 310 315 320 

CAC AGG AGC AAG CGA AGC GCC AGC CAT CCA CGC AAG CGC AAG AAG TCG 1008 
His Arg Ser Lys Arg Ser Ala Ser His Pro Arg Lys Arg Lys Lys Ser 
15 325 330 335 

GTG TCG CCC AAC AAC GTG CCG CTG CTG GAA CCG ATG GAG AGC ACG CGC 1056 

Val Ser Pro Asn Asn Val Pro Leu Leu Glu Pro Met Glu Ser Thr Arg 

340 345 350 

20 

AGC TGC CAG ATG CAG ACC CTG TAC ATA GAC TTC AAG GAT CTG GGC TGG 1104 

Ser Cys Gin Het Gin Thr Leu Tyr He Asp Phe Lys Asp Leu Gly Trp 
355 360 365 

25 CAT GAC TGG ATC ATC GCA CCA GAG GGC TAT GGC GCC TTC TAC TGC AGC 1152 
His Asp Trp He He Ala Pro Glu Gly Tyr Gly Ala Phe Tyr Cys Ser 
370 375 380 

GGC GAG TGC AAT TTC CCG CTC AAT GCG CAC ATG AAC GCC ACG AAC CAT 1200 
30 Gly Glu Cys Asn Phe Pro Leu Asn Ala His Het Asn Ala Thr Asn His 
385 390 395 400 

GCG ATC GTC CAG ACC CTG GTC CAC CTG CTG GAG CCC AAG AAG GTG CCC 1248 
Ala He Val Gin Thr Leu Val His Leu Leu Glu Pro Lys Lys Val Pro 
35 405 410 415 

AAG CCC TGC TGC GCT CCG ACC AGG CTG GGA GCA CTA CCC GTT CTG TAC 1296 
Lys Pro Cys Cys Ala Pro Thr Arg Leu Gly Ala Leu Pro Val Leu Tyr 
420 425 430 



40 



CAC CTG AAC GAC GAG AAT GTG AAC CTG AAA AAG TAT AGA AAC ATG ATT 1344 
His Leu Asn Asp Glu Asn Val Asn Leu Lys Lys Tyr Arg Asn Het He 
435 440 445 



45 GTG AAA TCC TGC GGG TGC CAT TGA 1368 
Val Lys Ser Cys Gly Cys His 
450 455 
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(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 455 amino acids 
5 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

10 (xl) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Met Ser Gly Leu Are Asn Thr Ser Glu Ala Val Ala Val Leu Ala Ser 
15 10 15 

15 Leu Gly Leu Gly Met Val Leu Leu Met Phe Val Ala Thr Thr Pro Pro 
20 25 30 

Ala Val Glu Ala Thr Gin Ser Gly He Tyr He Asp Asn Gly Lys Asp 
35 40 45 

20 

Gin Thr He Met His Arg Val Leu Ser Glu Asp Asp Lys Leu Asp Val 
50 55 60 

Ser Tyr Glu He Leu Glu Phe Leu Gly He Ala Glu Arg Pro Thr His 
25 65 70 75 80 

Leu Ser Ser His Gin Leu Ser Leu Arg Lys Ser Ala Pro Lys Phe Leu 
85 90 95 

30 Leu Asp Val Tyr His Arg He Thr Ala Glu Glu Gly Leu Ser Asp Gin 
100 105 HO 

Asp Glu Asp Asp Asp Tyr Glu Arg Gly His Arg Ser Arg Arg Ser Ala 
115 120 125 

35 

Asp Leu Glu Glu Asp Glu Gly Glu Gin Gin Lys Asn Phe He Thr Asp 
130 135 140 

Leu Asp Lys Arg Ala He Asp Glu Ser Asp He He Met Thr Phe Leu 
40 145 150 155 160 

Asn Lys Arg His His Asn Val Asp Glu Leu Arg His Glu His Gly Arg 
165 170 175 

45 Arg Leu Trp Phe Asp Val Ser Asn Val Pro Asn Asp Asn Tyr Leu Val 
180 185 190 

Met Ala Glu Leu Arg He Tyr Gin Asn Ala Asn Glu Gly Lys Trp Leu 
195 200 205 

50 



WO 94/03075 



PCT/US93/07190 



- 133 - 



Thr Ala Asn Arg Glu Phe Thr He Thr Val Tyr Ala He Gly Thr Gly 
210 215 220 

Thr Leu Gly Gin His Thr Met Glu Pro Leu Ser Ser Val Asn Thr Thr 
5 225 230 235 240 

Gly Asp Tyr Val Gly Trp Leu Glu Leu Asn Val Thr Glu Gly Leu His 
245 250 255 

10 Glu Trp Leu Val Lys Ser Lys Asp Asn His Gly He Tyr He Gly Ala 
260 265 270 



15 



30 



His Ala Val Asn Arg Pro Asp Arg Glu Val Lys Leu Asp Asp He Gly 
275 280 285 

Leu He His Arg Lys Val Asp Asp Glu Phe Gin Pro Phe Met He Gly 
290 295 300 



Phe Phe Arg Gly Pro Glu Leu He Lys Ala Thr Ala His Ser Ser His 
20 305 310 315 320 

His Arg Ser Lys Arg Ser Ala Ser His Pro Arg Lys Arg Lys Lys Ser 
325 330 335 

25 Val Ser Pro Asn Asn Val Pro Leu Leu Glu Pro Met Glu Ser Thr Arg 
340 345 350 



Ser Cys Gin Met Gin Thr Leu Tyr lie Asp Phe Lys Asp Leu Gly Trp 
355 360 365 

His Asp Trp He He Ala Pro Glu Gly Tyr Gly Ala Phe Tyr Cys Ser 
370 375 380 



Gly Glu Cys Asn Phe Pro Leu Asn Ala His Met Asn Ala Thr Asn His 
35 385 390 395 400 

Ala He Val Gin Thr Leu Val His Leu Leu Glu Pro Lys Lys Val Pro 
405 410 415 

40 Lys Pro Cys Cys Ala Pro Thr Arg Leu Gly Ala Leu Pro Val Leu Tyr 
420 425 430 



45 



His Leu Asn Asp Glu Asn Val Asn Leu Lys Lys Tyr Arg Asn Met He 
435 440 445 

Val Lys Ser Cys Gly Cys His 
450 455 
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(2) INFORMATION FOR SEQ ID N0:26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 amino acids 
5 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

10 

(ix) FEATURE: 

(A) NAME /KEY: Protein 

(B) LOCATION: 1..104 

15 (D) OTHER INFORMATION: /note= "BMP3 n 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 

20 Cys Ala Arg Arg Tyr Leu Lys Val Asp Phe Ala Asp He Gly Trp Ser 

1 5 10 15 

Glu Trp He He Ser Pro Lys Ser Phe Asp Ala Tyr Tyr Cys Ser Gly 
20 25 30 

25 

Ala Cys Gin Phe Pro Met Pro Lys Ser Leu Lys Pro Ser Asn His Ala 
35 40 45 

Thr He Gin Ser He Val Ala Arg Ala Val Gly Val Val Pro Gly He 
30 50 55 60 

Pro Glu Pro Cys Cys Val Pro Glu Lys Met Ser Ser Leu Ser He Leu 
65 70 75 80 

35 Phe Phe Asp Glu Asn Lys Asn Val Val Leu Lys Val Tyr Pro Asn Met 

85 90 95 

Thr Val Glu Ser Cys Ala Cys Arg 
100 

40 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 
45 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

50 



WO 94/03075 



PCT/US93/07190 



- 135 - 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: HOMO SAPIENS 

(ix) FEATURE: 
5 (A) NAME/KEY: Protein 

(B) LOCATION: 1..102 

(D) OTHER INFORMATION: /note= "BHP5" 



10 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

Cys Lys Lys His Glu Leu Tyr Val Ser Pbe Arg Asp Leu Gly Trp Gin 
15 10 15 

. 15 Asp Trp lie lie Ala Pro Glu Gly Tyr Ala Ala Phe Tyr Cys Asp Gly 

20 25 30 



20 



Glu Cys Ser Phe Pro Leu Asn Ala His Het Asn Ala Thr Asn His Ala 
35 40 45 

He Val Gin Thr Leu Val His Leu Met Phe Pro Asp His Val Pro Lys 
50 55 60 



Pro Cys Cys Ala Pro Thr Lys Leu Asn Ala He Ser Val Leu Tyr Phe 
25 65 70 75 80 

Asp Asp Ser Ser Asn Val He Leu Lys Lys Tyr Arg Asn Het Val Val 
85 90 95 

30 Arg Ser Cys Gly Cys His 

100 

(2) INFORMATION FOR SEQ ID NO: 28: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



40 



45 



50 



(ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: HOMO SAPIENS 

(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..102 

(D) OTHER INFORMATION: /note= "BMP 6" 



WO 94/03075 



PCT/US93/07190 



- 136 - 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Cys Arg Lys His Glu Leu Tyr Val Ser Phe Gin Asp Leu Gly Trp Gin 
15 10 15 

5 

Asp Trp He He Ala Pro Lys Gly Tyr Ala Ala Asn Tyr Cys Asp Gly 
20 25 30 

Glu Cys Ser Phe Pro Leu Asn Ala His Met Asn Ala Thr Asn His Ala 
10 35 40 45 

He Val Gin Thr Leu Val His Leu Met Asn Pro Glu Tyr Val Pro Lys 
50 55 60 

15 Pro Cys Cys Ala Pro Thr Lys Leu Asn Ala He Ser Val Leu Tyr Phe 

65 70 75 80 



20 



Asp Asp Asn Ser Asn Val He Leu Lys Lys Tyr Arg Trp Met Val Val 
85 90 95 

Arg Ala Cys Gly Cys His 
100 



(2) INFORMATION FOR SEQ ID NO: 29: 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



30 



(ii) MOLECULE TYPE: protein 



(ix) FEATURE: 
35 (A) NAME/KEY: Protein 

(B) LOCATION: 1..102 
(D) OTHER INFORMATION: /label" OPX 

/note- "WHEREIN EACH ZAA IS INDEPENDENTLY SELECTED 
FROM A GROUP OF ONE OR MORE SPECIFIED AMINO ACIDS 
40 AS DEFINED IN THE SPECIFICATION (SECTION II.B.2.)" 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:29: 

45 Cys Xaa Xaa His Glu Leu Tyr Val Xaa Phe Xaa Asp Leu Gly Trp Xaa 

15 10 15 

Asp Trp Xaa He Ala Pro Xaa Gly Tyr Xaa Ala Tyr Tyr Cys Glu Gly 
20 25 30 

50 
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Glu Cys Xaa Phe Pro Leu Xaa Ser Xaa Met Asn Ala Thr Asn His Ala 
35 40 45 

He Xaa Gin Xaa Leu Val His Xaa Xaa Xaa Pro Xaa Xaa Val Pro Lys 
5 50 55 60 

Xaa Cys Cys Ala Pro Thr Xaa Leu Xaa Ala Xaa Ser Val Leu Tyr Xaa 
65 70 75 80 

10 Asp Xaa Ser Xaa Asn Val Xaa Leu Xaa Lys Xaa Arg Asn Met Val Val 

85 90 95 



15 



Xaa Ala Cys Gly Cys His 
100 

(2) INFORMATION FOR SEQ ID NO: 30: 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 97 amino acids 
20 (B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 



25 



(ii) MOLECULE TYPE: protein 



(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..97 

30 (D) OTHER INFORMATION: /label- GENERIC-SEQ5 

/note- "WHEREIN EACH XAA IS INDEPENDENTLY SELECTED 
FROM A GROUP OF ONE OR MORE SPECIFIED AMINO ACIDS 
AS DEFINED IN THE SPECIFICATION. " 



35 



40 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

Leu Xaa Xaa Xaa Phe Xaa Xaa Xaa Gly Trp Xaa Xaa Trp Xaa Xaa Xaa 
15 10 15 

Pro Xaa Xaa Xaa Xaa Ala Xaa Tyr Cys Xaa Gly Xaa Cys Xaa Xaa Pro 
20 25 30 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Asn His Ala Xaa Xaa Xaa Xaa Xaa 
45 35 40 45 
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Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys Cys Xaa Pro 
50 55 60 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Leu Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
65 70 75 80 

Val Xaa Leu Xaa Xaa Xaa Xaa Xaa Met Xaa Val Xaa Xaa Cys Xaa Cys 
85 90 95 



10 Xaa 



(2) INFORMATION FOR SEQ ID NO: 31: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



20 



40 



(ii) MOLECULE TYPE: protein 



(ix) FEATURE: 
25 (A) NAME/KEY: Protein 

(B) LOCATION: 1..102 

(D) OTHER INFORMATION: /label= GENERIC-SEQ6 

/note= "WHEREIN EACH XAA IS INDEPENDENTLY SELECTED 
FROM A GROUP OF ONE OR MORE SPECIFIED AMINO ACIDS 
30 AS DEFINED IN THE SPECIFICATION. " 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:31: 

35 Cys Xaa Xaa Xaa Xaa Leu Xaa Xaa Xaa Phe Xaa Xaa Xaa Gly Trp Xaa 

1 5 10 15 



Xaa Trp Xaa Xaa Xaa Pro Xaa Xaa Xaa Xaa Ala Xaa Tyr Cys Xaa Gly 
20 25 30 

Xaa Cys Xaa Xaa Pro Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Asn His Ala 
35 40 A5 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
45 50 55 60 
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laa Cys Cys Xaa Pro laa Zaa Zaa Zaa Zaa Zaa Zaa Zaa Leu Zaa Zaa 
65 70 75 80 

Zaa Zaa Zaa Zaa Zaa Val Zaa Leu Zaa Zaa Zaa Zaa Zaa Met Zaa Val 
5 85 90 95 

Zaa Zaa Cys Xaa Cys Zaa 
100 

10 (2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1247 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRAND EDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISH: HOMO SAPIENS 
(F) TISSUE TYPE: BRAIN 

(ix) FEATURE: 
25 (A) NAME/KEY: CDS 

(B) LOCATION: .84.. 1199 

(D) OTHER INFORMATION: /product- "GDF-1" 
/note= "GDF-1 CDNA" 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
GGGGACACCG GCCCCGCCCT CAGCCCACTG GTCCCGGGCC GCCGCGGACC CTGCGCACTC 60 



35 TCTGGTCATC GCCTGGGAGG AAG ATG CCA CCG CCG CAG CAA GGT CCC TGC 110 

Met Pro Pro Pro Gin Gin Gly Pro Cys 
1 5 



45 



50 



GGC CAC CAC CTC CTC 
Gly His His Leu Leu 
10 


CTC 
Leu 
15 


CTC 
Leu 


CTG 
Leu 


GCC 
Ala 


CTG 
Leu 


CTG 
Leu 
20 


CTG 
Leu 


CCC 
Pro 


TCG 
Ser 


CTG 
Leu 


CCC 
Pro 
25 


158 


CTG ACC CGC GCC CCC 
Leu Thr Arg Ala Pro 
30 


GTG 
Val 


CCC 
Pro 


CCA 
Pro 


GGC 
Gly 


CCA 
Pro 
35 


GCC 
Ala 


GCC 
Ala 


GCC 
Ala 


CTG 
Leu 


CTC 
Leu 
40 


CAG 
Gin 


206 


GCT CTA GGA CTG CGC 
Ala Leu Gly Leu Arg 
45 


GAT 
Asp 


GAG 
Glu 


CCC 
Pro 


CAG 
Gin 
50 


GGT 
Gly 


GCC 
Ala 


CCC 
Pro 


AGG 
Arg 


CTC 
Leu 
55 


CGG 
Arg 


CCG 
Pro 


254 



WO 94/03075 



PCT/US93/07190 



- 140 - 



GTT CCC CCG GTC ATG TGG CGC CTG TTT CGA CGC CGG GAC CCC CAG GAG 
Val Pro Pro Val Met Trp Arg Leu Phe Arg Arg Arg Asp Pro Gin Glu 
60 65 70 

5 ACC AGG TCT GGC TCG CGG CGG ACG TCC CCA GGG GTC ACC CTG CAA CCG 
Thr Arg Ser Gly Ser Arg Arg Thr Ser Pro Gly Val Thr Leu Gin Pro 
75 80 85 

TGC CAC GTG GAG GAG CTG GGG GTC GCC GGA AAC ATC GTG CGC CAC ATC 
10 Cys His Val Glu Glu Leu Gly Val Ala Gly Asn He Val Arg His He 
90 95 100 105 

CCG GAC CGC GGT GCG CCC ACC CGG GCC TCG GAG CCT GTC TCG GCC GCG 
Pro Asp Arg Gly Ala Pro Thr Arg Ala Ser Glu Pro Val Ser Ala Ala 
110 115 120 

GGG CAT TGC CCT GAG TGG ACA GTC GTC TTC GAC CTG TCG GCT GTG GAA 
Gly His Cys Pro Glu Trp Thr Val Val Phe Asp Leu Ser Ala Val Glu 
125 130 135 

CCC GCT GAG CGC CCG AGC CGG GCC CGC CTG GAG CTG CGT TTC GCG GCG 
Pro Ala Glu. Arg Pro Ser Arg Ala Arg Leu Glu Leu Arg Phe Ala Ala 
140 145 150 



15 



20 



25 GCG GCG GCG GCA GCC CCG GAG GGC GGC TGG GAG CTG AGC GTG GCG CAA 
Ala Ala Ala Ala Ala Pro Glu Gly Gly Trp Glu Leu Ser Val Ala Gin 
155 160 165 

GCG GGC CAG GGC GCG GGC GCG GAC CCC GGG CCG GTG CTG CTC CGC CAG 
30 Ala Gly Gin Gly Ala Gly Ala Asp Pro Gly Pro Val Leu Leu Arg Gin 
170 175 180 185 

TTG GTG CCC GCC CTG GGG CCG CCA GTG CGC GCG GAG CTG CTG GGC GCC 
Leu Val Pro Ala Leu Gly Pro Pro Val Arg Ala Glu Leu Leu Gly Ala 
35 190 195 200 

GCT TGG GCT CGC AAC GCC TCA TGG CCG CGC AGC CTC CGC CTG GCG CTG 
Ala Trp Ala Arg Asn Ala Ser Trp Pro Arg Ser Leu Arg Leu Ala Leu 
205 210 215 



40 



45 



GCG CTA CGC CCC CGG GCC CCT GCC GCC TGC GCG CGC CTG GCC GAG GCC 
Ala Leu Arg Pro Arg Ala Pro Ala Ala Cys Ala Arg Leu Ala Glu Ala 
220 225 230 

TCG CTG CTG CTG GTG ACC CTC GAC CCG CGC CTG TGC CAC CCC CTG GCC 
Ser Leu Leu Leu Val Thr Leu Asp Pro Arg Leu Cys His Pro Leu Ala 
235 240 245 



302 



350 



398 



446 



494 



542 



590 



638 



686 



734 



782 



830 



WO 94/03075 



PCT/US93/07190 



- 141 - 



CGG CCG CGG CGC GAC GCC GAA CCC GTG TTG GGC GGC GGC CCC GGG GGC 
Arg Pro Arg Arg Asp Ala Glu Pro Val Leu Gly Gly Gly Pro Gly Gly 
250 255 260 265 

5 GCT TGT CGC GCG CGG CGG CTG TAC GTG AGC TTC CGC GAG GTG GGC TGG 
Ala Cys Arg Ala Arg Arg Leu Tyr Val Ser Phe Arg Glu Val Gly Trp 
270 275 280 

CAC CGC TGG GTC ATC GCG CCG CGC GGC TTC CTG GCC AAC TAC TGC CAG 
10 His Arg Trp Val lie Ala Pro Arg Gly Phe Leu Ala Asn Tyr Cys Gin 
285 290 295 

GGT CAG TGC GCG CTG CCC GTC GCG CTG TCG GGG TCC GGG GGG CCG CCG 
Gly Gin Cys Ala Leu Pro Val Ala Leu Ser Gly Ser Gly Gly Pro Pro 
15 300 305 310 

GCG CTC AAC CAC GCT GTG CTG CGC GCG CTC ATG CAC GCG GCC GCC CCG 
Ala Leu Asn His Ala Val Leu Arg Ala Leu Met His Ala Ala Ala Pro 
315 320 325 



20 



35 



GGA GCC GCC GAC CTG CCC TGC TGC GTG CCC GCG CGC CTG TCG CCC ATC 
Gly Ala Ala Asp Leu Pro Cys Cys Val Pro Ala Arg Leu Ser Pro He 
330 335 340 345 



25 TCC GTG CTC TTC TXT GAC AAC AGC GAC AAC GTG GTG CTG CGG CAG TAT 
Ser Val Leu Phe Phe Asp Asn Ser Asp Asn Val Val Leu Arg Gin Tyr 
350 355 360 

GAG GAC ATG GTG GTG GAC GAG TGC GGC TGC CGC TAACCCGGGG CGGGCAGGGA 
30 Glu Asp Met Val Val Asp Glu Cys Gly Cys Arg 
365 370 



CCCGGGCCCA ACAATAAATG CCGCGTGG 



(2) INFORMATION FOR SEQ ID NO: 33: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 372 amino acids 
40 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:33: 

Met Pro Pro Pro Gin Gin Gly Pro Cys Gly His His Leu Leu Leu Leu 
1 5 10 15 
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Leu Ala Leu Leu Leu Pro Ser Leu Pro Leu Thr Arg Ala Pro Val Pro 
20 25 30 

Pro Gly Pro Ala Ala Ala Leu Leu Gin Ala Leu Gly Leu Arg Asp Glu 
5 35 *0 . 45 

Pro Gin Gly Ala Pro Arg Leu Arg Pro Val Pro Pro Val Met Trp Arg 
50 55 60 

10 Leu Phe Arg Arg Arg Asp Pro Gin Glu Thr Arg Ser Gly Ser Arg Arg 
65 70 75 80 



Thr Ser Pro Gly Val Thr Leu Gin Pro Cys His Val Glu Glu Leu Gly 
85 90 95 

Val Ala Gly Asn He Val Arg His He Pro Asp Arg Gly Ala Pro Thr 
100 105 HO 

Arg Ala Ser Glu Pro Val Ser Ala Ala Gly His Cys Pro Glu Trp Thr 
115 120 125 

Val Val Phe Asp Leu Ser Ala Val Glu Pro Ala Glu Arg Pro Ser Arg 
130 135 140 

25 Ala Arg Leu Glu Leu Arg Phe Ala Ala Ala Ala Ala Ala Ala Pro Glu 
145 150 155 160 



15 



20 



30 



35 



45 



Gly Gly Trp Glu Leu Ser Val Ala Gin Ala Gly Gin Gly Ala Gly Ala 
165 170 175 

Asp Pro Gly Pro Val Leu Leu Arg Gin Leu Val Pro Ala Leu Gly Pro 
180 185 190 

Pro Val Arg Ala Glu Leu Leu Gly Ala Ala Trp Ala Arg Asn Ala Ser 
195 200 205 

Trp Pro Arg Ser Leu Arg Leu Ala Leu Ala Leu Arg Pro Arg Ala Pro 
210 215 220 



40 Ala Ala Cys Ala Arg Leu Ala Glu Ala Ser Leu Leu Leu Val Thr Leu 
225 230 235 240 



Asp Pro Arg Leu Cys His Pro Leu Ala Arg Pro Arg Arg Asp Ala Glu 
245 250 . 255 

Pro Val Leu Gly Gly Gly Pro Gly Gly Ala Cys Arg Ala Arg Arg Leu 
260 265 270 
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Tyr Val Ser Phe Arg Glu Val Gly Trp His Arg Trp Val lie Ala Pro 
275 280 285 

Arg Gly Phe Leu Ala Asn Tyr Cys Gin Gly Gin Cys Ala Leu Pro Val 
5 290 295 300 

Ala Leu Ser Gly Ser Gly Gly Pro Pro Ala Leu Asn His Ala Val Leu 
305 310 315 320 

10 Arg Ala Leu Met His Ala Ala Ala Pro Gly Ala Ala Asp Leu Pro Cys 

325 330 335 



15 



Cys Val Pro Ala Arg Leu Ser Pro lie Ser Val Leu Phe Phe Asp Asn 
340 345 350 

Ser Asp Asn Val Val Leu Arg Gin Tyr Glu Asp Met Val Val Asp Glu 
355 360 365 



Cys Gly Cys Arg 
20 370 
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What is claimed is: 

1. A morphogen-enriched infant formula composition. 

2. The morphogen-enriched infant formula of claim 1 wherein 
said formula is milk-based. 

3. The morphogen-enriched infant formula of claim 1 wherein 
said formula is soy-based. 

A # The morphogen-enriched infant formula of claim 1 wherein 
said formula is adapted for preterm infants. 



5. The morphogen-enriched infant formula of claim 1 wherein 
15 said formula is adapted for low birth weight infants. 

6. A morphogen-enriched dietary composition for individuals 
at risk for tissue damage due to protein-energy 
malnutrition. 

20 

7. The morphogen-enriched dietary composition of claim 6 
wherein said individual is at risk for tissue damage 
from starvation, dehydration, anorexia nervosa or 
trauma* 

8. The morphogen-enriched dietary composition of claim 6 
wherein said individual is at risk for tissue damage 
from a malabsorption-malnutrition disorder. 

30 9. The morphogen-enriched dietary composition of claim 8 

wherein malabsorption-malnutrition disorder results from 
a digestive or intestinal fistula, short bowel, 
gastrointestinal disorder or hyper catabolism. 



25 
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10. The morphogen-enricbed dietary composition of claim 6 
wherein said individual is at risk for 
malnutrition-malabsorption induced tissue damage 
following radiotherapy, chemotherapy or surgery. 

5 

11. A morphogen-enricbed dietary composition for individuals 
at risk for tissue damage due to altered metabolism 
function. 

10 12. The morphogen-enricbed dietary composition of claim 11 

wherein said individual at risk is a pregnant, 
lactacting or post-menopausal female. 

13. The morphogen-enriched dietary composition of claim 11 
15 wherein said individual at risk is an aged individual. 

14. The composition of claim 1, 6 or 11 wherein said 
morphogen is associated with a controlled release 
componen, adapted such that the morphogen is released in 

20 a controlled maimer lower in the gastrointestinal tract. 

15. The composition of claim 1, 6 or 11 adapted for enteral 
administration. 

25 16. The composition of claim 1, 6 or 11 adapted for aerosol 
administration. 

17. The composition of claim 1, 6 or 11 adapted for nasal 
administration. 



30 



18. The composition of claim 6 or 11 formulated as a solid. 
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19. The composition of claim 18 wherein said solid is a 
tablet, troche or lozenge. 

20. The composition of claim 1, 6 or 11 formulated as a 
5 liquid. 

21. The composition of claim 20 wherein said liquid is a 
beverage or a syrup. 

10 22* The composition of claim 1, 6 or 11 wherein said 

morphogen is associated with a morphogen-solubilizing 
molecule. 

23. The composition of claim 22 wherein said molecule is 
15 casein or a derivative, salt or analog thereof. 

24. The composition of claim 22 wherein said molecule 
comprises part or all of a morphogen pro domain. 

20 25. The composition of claim 1, 6 or 11 wherein said 

morphogen comprises an amino acid sequence sharing at 
least 70% homology with one of the sequences selected 
from the group consisting of: OP-1, OP-2, CBMP2, 
BMP3(fx), Vgl(fx), Vgr(fx), DPP(fx), GDF-l(fx) and 

25 60A(fx). 

26. The composition of claim 25 wherein said morphogen 
comprises an amino acid sequence sharing a last BOX 
homology with one of the sequences selected from the 
30 group consisting of: OP-1, 0P-2, CBMP2, BMP3(fx), 

BMP5(fx), BMP6(fx), Vgl(fx), Vgr(fx), DPP(fx), GDF-l(fx) 
and 60A(fx). 



WO 94/03075 



PCIYUS93/07190 



- 147 - 

27. The composition of claim 1, 6 or 11 wherein said 
morphogen comprises an amino acid sequence having 
greater than 60% amino acid identity with the sequence 
defined by residues 43-139 of Seq. ID No. 5 (hOPl). 

5 

28. The composition of claim 27 wherein said morphogen 
comprises an amino acid sequence having greater than 652 
amino acid identity with the sequence defined by 
residues 43-139 of Seq. ID No. 5 (hOPl). 

10 

29. The composition of claim 28 wherein said morphogen 
comprises an amino acid sequence defined by residues 
43-139 of Seq. ID No. 5 (hOPl), including allelic and 
species variants thereof. 

15 

30. The composition of claim 1, 6 or 11 wherein said 
morphogen comprises an amino acid sequence defined by 
Generic Sequences 1, 2, 3, 4, 5 or 6 (Seq. ID Nos. 1, 2, 
3, 4, 30 or 31). 

20 

31. The composition of claim 1, 6 or 11 wherein said 
morphogen comprises an amino acid sequence defined by 
OPX (Seq. ID No. 29). 

25 32. The composition of claim 1, 6 or 11 wherein said 

morphogen comprises an amino acid sequence encoded by a 
nucleic acid that hybridizes under stringent conditions 
with a DNA defined by nucleotides 1036-1341 of Seq. ID 
No. 16 or nucleotides 1390-1695 of Seq. ID No. 20. 

30 
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33. A method for improving the human milk mimetic 
characteristics of an infant formula, the method 
comprising the step of adding a morphogenically 
effective concentration of a morphogen to said formula 

5 prior to providing said formula to an infant. 

34. A method for enhancing tissue viability in a 
malnourished individual, the method comprising the step 
of providing to said individual a morphogenically 

10 effective concentration of a morphogen. 

35. A method for enhancing tissue viability in an individual 
having altered metabolic function, the method comprising 
the step of providing to said individual a 

15 morphogenically effective concentration of a morphogen. 

36. The method of claim 35 wherein said individual is a 
pregnant, lactating or postmenopausal female. 

20 37. The method of claim 35 wherein said individual is an 

aged individual. 

38. The method of claim 34 or 35 wherein said morphogen is 
provided to said individual as part of a food 

25 - formulation. 

39. The method of claim 34 or 35 wherein said morphogen is 
provided to said individual as a dietary supplement. 
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40. The method of claim 33, 34 or 35 wherein said morphogen 
comprises an amino acid sequence sharing at least 70% 
homology vith one of the sequences selected from the 
group consisting of: OP-1, OP-2, CBHP2, BMP3(fx), 

5 Vgl(fx), Vgr(fx), DPP(fx), GDF-l(fx) and 60A(fx). 

41. The method of claim 40 vherein said morphogen comprises 
an amino acid sequence sharing a last 80% homology vith 
one of the sequences selected from the group consisting 

10 of: OP-1, OP-2, CBMP2, BMP3(fx), BMP5(fx), BMP6(fx), 

Vgl(fx), Vgr(fx), DPP(fx), GDF-l(fx) and 60A(fx). 

42. The method of claim 33, 34 or 35 vherein said morphogen 
comprises an amino acid sequence having greater than 60% 

15 amino acid identity vith the sequence defined by 

residues 43-139 of Seq. ID No. 5 (hOPl). 

43. The method of claim 42 vherein said morphogen comprises 
an amino acid sequence having greater than 65% amino 

20 acid identity vith the sequence defined by 

residues 43-139 of Seq. ID No. 5 (hOPl). 

44. The method of claim 43 vherein said morphogen comprises 
an amino acid sequence defined by residues 43-139 of 

25 Seq. ID No. 5 (hOPl), including allelic and species 

variants thereof* 

45. The method of claim 33, 34 or 35 vherein said morphogen 
comprises an amino acid sequence defined by Generic 

30 Sequences 1, 2, 3, 4, 5 or 6 (Seq. ID Nos. 1, 2, 3, 4, 

30 or 31). 
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46. The method of claim 33, 34 or 35 wherein said morphogen 
comprises an amino acid sequence defined by OPX (Seq. ID 
No. 29). 

47. The method of claim 33, 34 or 35 wherein said morphogen 
comprises an amino acid sequence encoded by a nucleic 
acid that hybridizes under strigent conditions with a 
DNA defined by nucleotides 1036-1341 of Seq. ID No. 16 
or nucleotides 1390-1695 of Seq. ID No. 20. 



48. The invention of claim 1, 6, 11, 33, 34 or 35 wherein 
said morphogen comprises a dimeric protein species 
complexed with a peptide comprising a pro region of a 
member of the morphogen family, or an allelic, species 

15 or other sequence variant thereof. 

49. The invention of claim 48 wherein said dimeric morphogen 
species is noncovalently complexed with said peptide. 

20 50. The invention of claim 48 or 49 wherein said dimeric 
morphogen species is complexed with two said peptides. 

51. The invention of claim 48 or 49 wherein said peptide 
comprises at least the first eighteen amino acids of a 

25 sequence defining said pro region. 

52. The invention of claim 51 wherein said peptide comprises 
the full length of said pro region. 
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The invention of claim 48 or 49 wherein said peptide 
comprises a nucleic acid that hybridizes under stringent 
conditions with a DNA defined by nucleotides 136-192 of 
Seq. ID No. 16, or nucleotides 157-211 of Seq. ID No. 
20. 

The invention of claim 48 or 49 wherein said complex is 
further stabilized by exposure to a basic amino acid, a 
detergent or a carrier protein. 
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